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A gene switch comprising an ecdysone receptor 

The present invention relates to the identification and characterisation of insect steroid 
receptors from the Lepidoptera species Heliothis virescens, and the nucleic acid encoding 
5 therefor. The present invention also relates to the use of such receptors, and such nucleic acid, 
particularly, but not exclusively, in screening methods, and gene switches. 
By gene switch we mean a gene sequence which is responsive to an applied exogenous 
chemical inducer enabling external control of expression of the gene controlled by said gene 
sequence. 

10 Lipophilic hormones such as steroids induce changes in gene expression to elicit 

profound effects on growth, cellular differentiation, and homeostasis. These hormones 
recognise intracellular receptors that share a common modular structure consisting of three 
main functional domains: a variable amino terminal region that contains a transactivation 
domain, a DNA binding domain, and a ligand binding domain on the carboxyl side of the 

15 molecule. The DNA binding domain contains nine invariant cysteines, eight of which are 
involved in zinc coordination to form a two-finger structure. In the nucleus the 
hormone-receptor complex binds to specific enhancer-like sequences called hormone response 
elements (HREs) to modulate transcription of target genes. 

The field of insect steroid research has undergone a revolution in the last three years 

20 as a result of the cloning and preliminary characterisation of the first steroid receptor member 
genes. These developments suggest the time is ripe to try to use this knowledge to improve 
our tools in the constant fight against insect pests. Most of the research carried out on the 
molecular biology of the steroid receptor superfamily has been on Drosophila melanogaster 
(Diptera), see for example International Patent Publication No W09 1/13167, with some in 

25 Manduca and Galleria (Lepidoptera). 

It has been three decades since 20-hydroxyecdysone was first isolated and shown to 
be involved in the regulation of development of insects. Since then work has been carried out 
to try to understand the pathway by which this small hydrophobic molecule regulates a 
number of activities. By the early 1970s, through the studies of Clever and Ashbumer, it was 

30 clear that at least in the salivary glands of third instar Drosophila larvae, the application of 
ecdysone lead to the reproducible activation of over a hundred genes. The ecdysone receptor 
in this pathway is involved in the regulation of two classes of genes: a small class (early 
genes) which are induced by the ecdysone receptor and a large class (late genes) which are 
repressed by the ecdysone receptor. The early class of genes are thought to have two 

35 functions reciprocal to those of the ecdysone receptor, the repression of the early transcripts 
and the induction of late gene transcription. Members of the early genes so far isolated and 
characterised belong to the class of molecules with characteristics similar to known 
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^nscnpUon factors They are thus predicted to behave as expected by the mode, of ecdysone 
acuon (Ashbumer, 1991). More recently, the early genes E74 and E75 have been showTto 

1991) .thussupportingtheirproposeddualactivities. It should be noted however thatthe 
. acavauonofanierarchyofgenesisnotlirnitedto third instar larvae salivary glandT It 
the response to the ecdysone peak at the end of larva, life is observed in nZy other ZT 
such as the tmaginal disks (i.e. those tissues which metamorphose to adult stl^s) aid 
o*er larval issues which histolyse at the end of larval life (eg. larva, fat body). W for 
ecdysone act™ as deduced by studying the third instar chromosome puffing may no 7 aPP ' 

x: f 2::: n *:? r: regulated genes * * °^ — *• ~" 

«he factors in addition to the active ecdysone receptor must be satisfied for correct 
developmenul expression (e.g. the DrosopMa yolk prote in gene expression in adults is 
under control of doublesex, the last gene in the sex determination gene hierarchy) 

™ ec <* s ° ne ^ptor and the early gene E75 belong to the steroid receptor 
^rfarmly O^nrosopHila genes, including ultraspiracle, tailless, sevenup aT^-H 
also belong to this family. However, of all these genes only the ecdysone receptor isTo™ 
to have a hgand, and thus the others are known as orphan receptors J^ £Z? 
__*eultr^^ __ _ 

retinoic X receptor (RXR) hgand binding region (Ore et al., 1990), they do not s^ the 
same hgand ( ,e. the RXR ligand is 9-cis retinoic acid) (Heymann et al.. 1992 1 
Mangelsdorfetal.,1992). All the Drosophila genes mentioned are involved in development 
ul^racle for example, is recruited for embryonic and larva, abdominal Jl^T 

ffvans, 1988, Green and Chambon, 1988, Beato, 1989) i.e. they have a variable N terminus 
STZ V ^ indePendent ttanSaCtiVaIi0n A and B). a highly c« 

^ol^)\t re81 ° nWhi f -specific Ls 

£d7w n ! re81 ° n to ^ 3 nUdCar ttanslocation ««- domain D) 

and the dimensauon phase (Domain E) The last m ™ * c- , 

its function is unknown. * ' ^ * " *° Veiy VariaWe and 

Steroid receptor action has been elucidated in considerable detail in vertebrate svste™ 

K ^sTZTT dm0l r MS - ta *- b — -^-erecep^r ^ 
resides inthe cytoplasm where it is bound by Hsp90. Hs P 70, and P 59 to form the inactive 

hZeTT 1988> - UP<M1 ^ ° f UgSWd M ^ "e-ceptoraco^onal 
seen in the hgand binding domain of both progesterone and retinoic acid receptor, (Allan et 
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al., 1992a). This conformational change has been further characterised in the progesterone 
receptor and was found to be indispensable for gene transactivation (Allan et al., 1992b). 
Once inside the nucleus the receptor dimer binds to the receptor responsive element at a 
specific site on the DNA resulting in the activation or repression of a target gene. The 
5 receptor responsive elements usually consist of degenerate direct repeats, with a spacer 

between 1 and 5 nucleotides, which are bound by a receptor dimer through the DNA binding 
region (Domain C). 

Whereas some steroid hormone receptors are active as homodimers others act as 
heterodimers. For example, in vertebrates, the retinoic acid receptor (RAR) forms 

10 heterodimers with the retinoic X receptor (RXR). RXR can also form heterodimers with 
the thyroid receptor, vitamin D receptor (Yu et al., 1991; Leid et al., 1992) and peroxisome 
activator receptor (Kliewer et al., 1992). Functionally the main difference between 
homodimers and heterodimers is increased specificity of binding to specific response 
elements. This indicates that different pathways can be linked, co-ordinated and modulated, 

15 and more importantly this observation begins to explain the molecular basis of the pleotropic 
activity of retinoic acid in vertebrate development (Leid et al., 1992b). Similarly, the 
Drosophila ultraspiracle gene product was recently shown to be capable of forrning 
heterodimers with retinoic acid, thyroid, vitamin D and peroxisome activator receptors and to 
stimulate the binding of these receptors to their target responsive elements (Yao et al., 1993). 

20 More significantly, the ultraspiracle gene product has also been shown to form heterodimers 
with the ecdysone receptor, resulting in cooperative binding to the ecdysone response element 
and capable of rendering mammalian cells ecdysone responsive (Yao et al., 1992). The latter 
is of importance since transactivation of the ecdysone gene alone in mammalian cells fails to 
elicit an ecdysone response (Koelle et al., 1991), therefore suggesting that the ultraspiracle 

25 gene product is an integral component of a functional ecdysone receptor (Y ao et al., 1992). 
It is possible that the ultraspiracle product competes with other steroid receptors or factors 
to form heterodimers with the ecdysone receptor. Moreover it remains to be investigated if 
ultraspiracle is expressed in all tissues of the Drosophila larvae. Despite ultraspiracle being 
necessary to produce a functional ecdysone receptor, the mechanism by which this activation 

30 takes place is as yet undetermined. 

We have now isolated and characterised the ecdysone steroid receptor from Heliothis 
virescens (hereinafter HEcR). We have found that surprisingly unlike the Drosophila 
ecdysone steroid receptor (hereinafter DEcR), in reports to-date, HEcR can be induced by 
known non-steroidal inducers. It will be appreciated that this provides many advantages for 

35 the system. 

Steroids are difficult and expensive to make. In addition, the use of a non-steroid as 
the inducer allows the system to be used in agrochemical and pharmaceutical applications, not 
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least because it avoids application of a steroid which is already present in insects and/or 
mammals. For example, it would not be feasible to use a gene switch in a mammalian cell 
wmch was tnduced by a naturally occurring steroidal induce, It will also be appreciated that 
for envu-onmental reasons it is advantageous to avoid the use of steroids as inducers 

Accordmg to one aspect of the present invention there is provided DNA having the 
sequence shown „ Seq ID No. 2, wherein Seq ID No 2 gives the sequence for the HEcR 

According to another aspect of the present invention there is provided DNA having 

EH SeqUeDCe Sh0W ° " ^ m 2 ' ^ enC ° deS f0f 016 *** bindin g 
10 „ art ft C CCO ^ ngtOM ° Aer ^ tOftheprcsent ^ venti0n the re 

SCqUenCe Sh ° Wn ^ 10 2 ' WhiCh enCOdeS f ° r m ° R DNA bindin S 

According to yet another aspect of the present invention there is provided DNA 
having part of the sequence shown in Seq ID No. 2, which encodes for the HEcR 

15 transactivation domain. 

nart f ,C CC ° rdinS IO 3 351,601 ° f PrCSent inVention there is DNA having 

part of the sequence shown in Seq ID No. 2, which encodes for the HEcR hinge domain 

- - --- -Accordmg toastill further aspect of the present mvenuon there is provrded DNA 

secuen AC H 0rdinS 1 ^ ° f PreSeDt inVCn,i0n toB iS P TOVided DN A having the 
sequence shown tn Seq ID No. 3, wherein Seq ID No 3 gives the sequence for the HEcR 

nart of * ,0 r° ,her MPeCt ° f ^ PR!Sent iOVen,ion «"« is P™4* DNA having 

25 Sol * eSeqUenCeSh0Wn *^^^ 

Dart of ^ COrdin8 1 ° am>ther MPBCt ° f ^ PreSeM iDVenti0n *« is P rovid ^ DNA having 
Zl S6qUenCe ^ " ^ m NO " * WhiCh enC ° deS fOT *" ™* DNA binding 

According to yet another aspect of the present invention there is provided DNA 
havtng part of the sequence shown in Seq ID No. 3, which encodes for the HEcR 
transactivation domain. 

nart of ,f COrding t0 4 flBtber ° f ** PreSent inVenti0n *« is P™*" DNA having 
part of the sequence shown in Seq ID No. 3, which encodes for the HEcR hinge domain 

According to a still further aspect of the present invention there is provided DNA 

havmg part of the sequence shown in Seq ID No. 3, which encodes for the HEcR carboxy 

terminal region. J 
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According to one aspect of the present invention there is provided DNA having the 
sequence shown in Seq ID No. 4, wherein Seq ID No 4 gives the sequence for the HEcR. 

According to another aspect of the present invention there is provided DNA having 
part of the sequence shown in Seq ID No. 4, which encodes for the HEcR ligand binding 
5 domain. 

According to another aspect of the present invention there is provided DNA having 
part of the sequence shown in Seq ID No. 4, which encodes for the HEcR DNA binding 
domain. 

According to yet another aspect of the present invention there is provided DNA 
10 having part of the sequence shown in Seq ID No. 4, which encodes for the HEcR 
transactivation domain. 

According to a further aspect of the present invention there is provided DNA having 
part of the sequence shown in Seq ID No. 4, which encodes for the HEcR hinge domain. 
According to a still further aspect of the present invention there is provided DNA 
15 having part of the sequence shown in Seq ID No. 4, which encodes for the HEcR carboxy 
terminal region. 

As mentioned above, steroid receptors are eukaryotic transcriptional regulatory 
factors which, in response to the binding of the steroid hormone, are believed to bind to 
specific DNA elements and activate transcription. The steroid receptor can be divided into six 

20 regions, designated A to F, using alignment techniques based on shared homology with other, 
members of the steroid hormone receptor superfamily. Krust et al identified two main regions 
in the receptor, C and E. Region C is hydrophilic and is unusual in its high content in cysteine, 
lysine and arginine.lt corresponds to a DNA-binding domain, sometimes referred to as the 
"zinc finger". It is the DNA binding domain which binds to the upstream DNA of the 

25 responsive gene. Such upstream DNA is known as the hormone response element or HRE for 
short. Region E is hydrophobic and is identified as the hormone (or ligand) binding domain. 
Region E can be further subdivided into regions El, E2 and E3. 

The region D, which separates domains C and E is highly hydrophobic and is flexible. 
It is believe that communication between domains E and C involves direct contact between 

30 them through region D, which provides a hinge between the two domains. Region D is 
therefore referred to as the hinge domain. 

The mechanism of the receptor appears to require it to interact with some elements) 
of the transcription machinery over and above its interactions with the hormone and the 
hormone response element N-terminal regions A and B perform such a function and are 

35 jointly known as the transactivation domain. The carboxy terminal region is designated F. 
The domain boundaries of the HEcR can be defined as follows: 
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DOMAIN 

Transactivating (A/Bl 
DNA Binding (C\ 
Hinge (D) 



Ligand Binding (E) 
C-Terminal End (F) 



base pairs 
114-600 
601-798 
799-1091 
1092-17S7 
1758-1844 



INTERVALS 



amino acids 
1-162 
163-228 
229-326 
327-545 
546-577 



The DNA binding domain is v ei y well defined and is 66 amino acids long thus 
providing good boundaries. The above intervals have been define* using the 2T 
' alignment for the ecdysone receptors (Figure 5). e multiple 

The present invention also includes DNA which shows homology to the sequences of 
represent invention. Typically homology is shown when 60% or more* the nudeddes Z 

80% or 85%. especially preferred are 90%, 95%. 98% or 99% or more homology 

do^NA COdCS " ^ ^ «<* -eptor o 

domauvDNA binding domain, hinge domain, ligand binding domain and/or carboxy tenZT 
region ^ such hybridisation ^ ^ of terminai 

conditions. In general terms, low stringency conditions can be defined as 3 TsccTt about 

65 C. SSC is the name of a buffer of 0.15M NaCl. 0.015M trisodium citrate 3 x SSC k h 
time as strong as SSC and so on. SC ,S 111166 

™;P re * nt ^ntion furmer^^ 
genetic code to the DNA of the present invention and which codes for a polyTpdde wWch is 

domainjunge domain, ligand binding domain and/or carboxy terminal region 

The DNA of the present invention may be cDNA or DNA which is in an isolated form 
According to another aspect of the present invention there is provided a poIvpep.il 

freC fr ° m 0ther P roteins which it is ordinarily associated and wl** is 
coded for by any of the DNA of the present invention 

which hiT 1 " 8 10 T 3SPeCt ° f ^ PreSCnt teventibn ^ - Prided a polypeptide 
which has the amino acid sequence of Seq ID No. 4 or any allelic variant or oerivatiT 
thereof, wherein Seq ID No. 4 gives the amino acid sequence of the HEcR polypepdL 
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According to another aspect of the present invention there is provided a polypeptide 
which has part of the amino acid sequence of Seq ID No. 4 or any allelic variant or derivative 
thereof, which sequence provides the HEcR ligand binding domain. 

According to another aspect of the present invention there is provided a polypeptide 
5 which has part of the amino acid sequence of Seq ID No. 4 or any allelic variant or derivative 
thereof, which sequence provides the HEcR DNA binding domain. 

According to yet another aspect of the present invention there is provided a 
polypeptide which has part of the amino acid sequence of Seq ID No. 4 or any allelic variant 
or derivative thereof, which sequence provides the HEcR transactivation domain. 
10 According to a further aspect of the present Invention there is provided a polypeptide 

which has the amino acid sequence of a part of Seq ID No. 4 or any allelic variant or 
derivative thereof, which sequence provides the HEcR hinge domain. 

According to a still further aspect of the present invention there is provided a 
polypeptide which has the amino acid sequence of a part of Seq ID No. 4 or any allelic variant 
15 or derivative thereof, which sequence provides the HEcR carboxy terminal region. 

For the avoidance of doubt, spliced variants of the amino acid sequences of the 
present invention are included in the present invention. 

Preferably, said derivative is a homologous variant which has conservative amino acid 
changes. By conservation amino acid changes we mean replacing an amino acid from one of 
20 the amino acid groups, namely hydrophobic, polar, acidic or basic, with an amino acid from 
within the same group. An examples of such a change is the replacement of valine by 
methionine and vice versa. 

According to another aspect of the present invention there is provided a fusion 
polypeptide comprising at least one of the polypeptides of the present invention functionally 
25 linked to an appropriate non-Heliothis ecdysone receptor domain(s). 

According to an especially preferred embodiment of the present invention the HEcR 
ligand binding domain of the present invention is fused to a DNA binding domain and a 
transactivation domain. 

According to another embodiment of the present invention the DNA binding domain is 
30 fused to a ligand binding domain and a transactivation domain. 

According to yet another embodiment of the present invention the transactivation 
domain is fused to a ligand binding domain and a DNA binding domain. 

The present invention also provides recombinant DNA encoding for these fused 
polypeptides. 

35 According to an especially preferred embodiment of the present invention there is 

provided recombinant nucleic acid comprising a DNA sequence encoding the HEcR ligand 
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35 



binding domain functionally linked to DNA encoding the DNA binding domain and 
transactivation domain from a glucocorticoid receptor. 

According to yet another aspect of the present invention there is provided 
recombinant nucleic acid comprising a DNA sequence comprising a reporter gene operably 
hnked to a promoter sequence and a hormone response element which hormone response 
element is responsive to the DNA bonding domain encoded by the DNA of of the present 
invention. y 

According* another aspect of the present invention there is provided a construct 
^formed with nucleic acid, recombinant DNA, a polypeptide or a fusion polypeptide of the 

~« f" 7"°? COnStfUCtS iDClUde P,aSmidS " d ^ SUi * We *» «Lng a 
cell of mterest. Such constructs will be well known to those skilled in the art 

According to another aspect of the present invention there is provided a cell 
transformed with nucleic acid, recombinant DNA, a polypeptide, or a fusion polypeptide of 
the present invention. jf^uucui 

15 Preferably the cell is a plant, fungus or mammaUan cell. 

For the avoidance of doubt fungus includes yeast. 

The present invention therefore provides a gene switch which is operably linked to a 
„__fo^gn.gene oraseries of-foreign genes whereby expression or said ^ 

senes of foreign genes may be controlled by application of an effective exogenous inducer 
Analogs of ecdysone, such as Muristerone A, are found in plants and disrupt the ' 
development o insects. It is therefore proposed that the receptor of the present invention can 
be used be ,n plants transformed therewith as an insect control mechanism. The production of 
me msect-damagmg product being controlled by an exogenous inducer. The insect-damagin 
g product can be ecdysone or another suitable protein. 

5 rh 5 j^r" st ^ 

RH-5849 [1.2^benzoyl, 1 -ten-butyl hydrazide], which is commercially available as an 
msecuc.de from Rohm and Haas, were described back in 1988. Another commercially 
avaikble compound in this series is RH-5992 [tebufenozide, 3,5-dimethylbenzoic acid 1-1 
(l.l-dimethylethyD-aC^thylbenroyl) hydrazide]. These compounds mimic 
• 20-hydroxyecdysone (20E) in both Mmduca sexta and D rosophOa melanogaste, These 
compounds have the advantage that they have the potential to control insects using 
ecdysteroid agonists which are non-steroidal. Further Examples of such dibenzoyl hydrazines 

?^aZ T M Na - 5 ' 1 17 '° 57 10 Rohm aBd Haas ' 311(1 0ikawa et * ™< s - «• 

139-148 (1994). However, it wdl be appreciated that any inducer of the gene switch of the 
present mvenuon. whether steroidal or non-steroidal, and which is currently or becomes 
available, may be used. es 
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The gene switch of the present invention, then, when linked to an exogenous or 
foreign gene and introduced into a plant by transformation, provides a means for the external 
regulation of expression of that foreign gene. The method employed for transformation of the 
plant cells is not especially germane to this invention and any method suitable for the target 
5 plant may be employed. Transgenic plants are obtained by regeneration from the transformed 
cells. Numerous transformation procedures are known from the literature such as 
agroinfection using Agrobacterium twnefaciens or its Ti plasmid, electroporation, 
microinjection or plants cells and protoplasts, microprojectile transformation, to mention but a 
few. Reference may be made to the literature for full details of the known methods. 

10 Neither is the plant species into which the chemically inducible sequence is inserted 

particularly germane to the invention. Dicotyledonous and monocotyledortous plants can be 
transformed. This invention may be applied to any plant for which transformation techniques 
are, or become, available. The present invention can therefore be used to control gene 
expression in a variety of genetically modified plants, including field crops such as canola, 

15 sunflower, tobacco, sugarbeet, and cotton; cereals such as wheat, barley, rice, maize, and 
sorghum; fruit such as tomatoes, mangoes, peaches, apples, pears, strawberries, bananas and 
melons; and vegetables such as carrot, lettuce, cabbage and onion. The switch is also suitable 
for use in a variety of tissues, including roots, leaves, stems and reproductive tissues. 

In a particularly preferred embodiment of the present invention, the gene switch of the 

20 present invention is used to control expression of genes which confer resistance herbicide 
resistance and/or insect tolerance to plants. 

Recent advances in plant biotechnology have resulted in the generation of transgenic 
plants resistant to herbicide application, and transgenic plants resistant to insects. Herbicide 
tolerance has been achieved using a range of different transgenic strategies. One well 

25 documented example in the herbicide field is the use the bacterial xenobiotic detoxifying gene 
phosphinothricin acetyl transferase (PAT) from Streptomyces hydroscopicus. Mutated genes 
of plant origin, for example the altered target site gene encoding acetolactate synthase (ALS) 
from Arabidopsis, have been successfully utilised to generate transgenic plants resistant to 
herbicide application. The PAT and ALS genes have been expressed under the control of 

30 strong constitutive promoter. In the field of insecticides, the most common example to-date is 
the use of the Bt gene. 

We propose a system where genes conferring herbicide and/or insect tolerance would 
be expressed in an inducible manner dependent upon application of a specific activating 
chemical. This approach has a number of benefits for the farmer, including the following: 

35 1 . Inducible control of herbicide and/or insect tolerance would alleviate any risk of yield 
penalties associated with high levels of constitutive expression of herbicide and/or 
insect resistance genes. This may be a particular problem as early stages of growth 
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where high levels of transgene product may directly interfere with normal 
deve opment Alternatively high levels of expression of herbicide and/or ^ 
res,stance genes may cause a metabolic drain for plant resources 

2. ^he expression of herbicide resistance genes in an inducible manner allows the 

onted d " l ° ^ ^ l ° C0Mr01 V ° 1UntterS * * e chemica. is 

omitted during treatment. 

3. The use of an inducible promoter to drive herbicide and/or insect resistance genes wiU 
^uce the risk of resistance becoming a major problem. If resistance gen" 

ZSr'rt^ *" rel3ted ^ C ° M «" * ^ with the 

TT" ° f ™ S WOUld P«y be relevant if 

the olerance gene confirmed resistance to a total vegetative control herbicide which 
would be used (with no inducing chemical) prior to sowing the crop and pl^y 
after the crop has been harvested. For example, it can be envisaged that^ 
-stance cereals, such as wheat, might outcross into the weed wild oa« I 

*at the inducible expression of herbicide resistance in sugar beet will reduce the risk 
of wild sugar beet becoming a problem. Similarly. i„ the field of msect control iJect 

T « T ^ beC ° mea Prob,em if ^* et ^^»constitutivelyexpreS L - 

control mechanisms to be employed. 

This strategy of inducible expression of herbicide resistance can be achieved with a 
P-spray of chemical activator or in the case of slow acting herbicides, for elanlpTe N 
Phosphonomethyl-glycine (commonly known as glyphosate). the chemical induT^ 

herbicide^irT Z u- * "* gene/corresponding 

herbicide combmauon, which is. or becomes, available. For example . the gene switch „f 7 
present invention can be used with: 8 ltch ° f 

1. Maize gfctathione S-transferase (GST-27) gene (see our Internationa, Patent 
P^cauon No WO90/08826). which confers resistance to chloroacetan^i 
hertncides such as acetochlor, metolachlor and alachlor 

2- Phosphinotricin acetyl transferase (PAT), which confers resistance to the herbicide 
commonly known as glufosinate. 

3- Acetolactate synthase gene mutants from maize (see our International Patent 
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4. Genes which confer resistance to glyphosate. Such genes include the glyphosate 

oxidoreductase gene (GOX) (see International Patent Publication No. WO92/00377); 
genes which encode for 5^nolpynivyl-3-phosphoshikimic acid synthase (EPSPS), 
including Class I and Class II EPSPS, genes which encode for mutant EPSPS, and 
5 genes which encode for EPSPS fusion peptides such as that comprised of a . 

chloroplast transit peptide and EPSPS (see for example EP 218 571, EP 293 358, 
WO91/04323, WO92/04449 and WO92/06201); and genes which arc involved in the 
expression of CPLyase. 

Similarly, the strategy of inducible expression of insect resistance can be adopted for 

10 any tolerance confering gene which is, or becomes, available. 

The gene switch of the present invention can also be used to controlled expression of 
foreign proteins in yeast and mammalian cells. Many heterologous proteins for many 
applications are produced by expression in genetically engineered bacteria, yeast cells and 
other eucaryotic cells such as mammalian cells. 

15 As well as the obvious advantage in providing control over the expression of foreign 

genes in such cells, the switch of the present invention provides a further advantage in yeasts 
and mammalian cells where accumulation of large quantities of an heterologous protein can 
damage the cells, or where the heterologous protein is damaging such that expression for 
short periods of time is required in order to maintain the viability of the cells. 

20 Such an inducible system also has applicability in gene therapy allowing the timing of 

expression of the therapeutic gene to be controlled. The present invention is therefore not 
only applicable to transformed rnarnmalian cells but also to mammals per se. 

A further advantage of the inducible system of the present invention in mammalian 
cells is that, because it is derived from a insect, there is less chance of it being effected by 

25 inducers which effect the natural mammalian steroid receptors. 

In another aspect of the present invention the gene switch is used to switch on genes 
which produce potentially damaging or lethal proteins. Such a system can be employed in the 
treatment of cancer in which cells are transformed with genes which express proteins which 
are lethal to the cancer. The timing of the action of such proteins on the cancer cells can be 

30 controlled using the switch of the present invention. 

The gene switch of the present invention can also be used to switch genes off as well 
as on. This is useful in disease models. In such a model the cell is allowed to grow before a 
specific gene(s) is switched off using the present invention. Such a model facilitates the study 
of the effect of a specific gene(s). 

35 Again the method for producing such transgenic cells is not particularly germane to 

the present invention and any method suitable for the target cell may be used; such methods 
are known in the art, including cell specific transformation. 
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As previously mentioned, modulation of ge „ e expression in the system appears in 
response to the binding of the HEcR to a specific control, or regulatory. DNA element A 
schematic representation of the HEcR gene switch is shown in Figure 6. For ease of 
reference, the schematic representation only shows three main domains of the HEcR namelv 
. the transactivation domain, DNA binding domain and the ligand binding doma^^a 
bgand to the hgand binding domain enables the DNA binding domain to bind to the^HRE 
resulting m expression (or indeed repression) of a target gene. 

The gene switch of the present invention can therefore be seen as having two 
components. The first component comprising the HEcR and a second component comprisina 
an^ppropnate HRE and the target gene. In practice, the switch may convenient JZT* 
form of one or two sequences of DNA. At least par, of the one sequence, or one sequent of 
*e pa,, encodmg the HEcR protein. Alternatively, the nucleic acid encoding the HEcRcan 
be replaced by the protein/ polypeptide itself. M 
Not only does the switch of the present invention have two components but also one 
or more of the domains of the receptor can be varied producing a chimeric gene swlt The 
switch of the present invention is very flexible and different combinations 1 be3 m olr 

The glucocorticoid steroid receptor is well characterised and has been found to work 
well m piants. A further advantage of this receptor is that it functions as a homodimt TOs 
means that .here is no need to express a second protein such as the ultraspi^letnT^o 
pnrfuce a funcuona, receptor. The problem with the glucocorticoid steJd receptor *tn* 
ligands used to activate it are not compatible with agronomic practice 

receotorD^^ 

Tcorle Th! 8 trMSaCtlVSUi ° n *»"*» with * "land binding domain 

accordmg to the present mvenuon. The response unit preferably comprising the 

gIucocoruco,d hormone response element and the desired effect gene. In the Examnles for 
convemence this effect gene took the form of a reporter gene. Holeve, Tn^ Z 
scree, stations the gene will be the gene which produces the desired effect, for ex2,e 
produces fce desired protein. This protein may be a natural or exogenous protein 12 be 
appreaated that this chimeric switch combines the best features of the gluLoL 
whilst overcoming the disadvantage of only being inducible by a steroid 

In anomerpreferr^embodiment,th e ^/ io , Al - J Ugand bmding domain is changed 

e^fr h yrcPl "f WithaM ^ ^--eptorngandbmdmgdoml^or 
example, we have elated suitable sequences from Spodoptera exigua. 
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Thus, according to another aspect of the present invention there is provided DNA 
having the sequence shown in Seq ID No. 6. 

According to another aspect of the present invention there is provided DNA having 
part of the sequence shown in Seq ID No. 6, which encodes for the Spodoptera ecdysone 
5 ligand binding domain. 

According to another aspect of the present invention there is provided DNA having 
part of the sequence shown in Seq ID No. 6, which encodes for the Spodoptera ecdysone 
hinge domain. 

The present invention also provides the polypeptides coded for by the above DNA 
10 sequences of Seq ID No. 6. 

A further advantage with such chimeric systems is that they allow you to choose the 
promoter which is used to drive the effector gene according to the desired end result. For 
example, placing the foreign gene under the control of a cell specific promoter can be 
particularly advantageous in circumstances where you wish to control not only the timing of 
15 expression, but also which cells expression occurs in. Such a double control can be 
particularly important in the areas of gene therapy and the use of cytotoxic proteins. 

Changing the promoter also enables gene expression to be up- or down-regulated as 
desired. 

Any convenient promoter can be used in the present invention, and many are known in 

20 the art. 

Any convenient transactivation domain may also be used. The transactivation domain 
VP 16 is a strong activator from Genentech Inc., and is commonly used when expressing 
glucocorticoid receptor in plants. Other transactivation domains derived for example from 
plants or yeast may be employed. 

25 In a preferred embodiment of the present invention, the DNA binding domain is the 

glucocorticoid DNA binding domain. This domain is commonly a human glucocorticoid 
receptor DNA binding domain. However, the domain can be obtained from any other 
convenient source, for example, rats. 

According to another aspect of the present invention there is provided a method of 

30 selecting compounds capable of being bound to an insect steroid receptor superfamily member 
comprising screening compounds for binding to a polypeptide or fusion polypeptide of the 
present invention, and selecting said compounds exhibiting said binding. 

According to another aspect of the present invention there is provided a compound 
selected using the method of the present invention. 

35 According to another aspect of the present invention there is provided an agricultural 

or pharmaceutical composition comprising the compound of the present invention. 
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According to yet another aspect of the present invention there is provided the use of 
the compound of the present invention as a pesticide, pharmaceutical and/or inducer of the 
switch. It will be appreciated that such inducers may well be useful as insecticides in 
themselves. 

5 According to a further aspect of the present invention there is provided a method of 

producing a protein or peptide or polypeptide comprising introducing into a cell of the present 
invention, a compound which binds to the ligand binding domain in said cell. 

Various preferred features and embodiments of the present invention will now be 
described by way of non-limiting example with reference to the accompanying examples and 
10 figures, in which figures: 

Figure 1 (Sequence ID No. 1) shows the DNA sequence amplified from first strand 
cDNA made from mRNA isolated from Heliothis virescens Fourth instar larvae. The 
underlined sequences refer to the position of the degenerate oligonucleotides. At the 5' end 
the sequence matches that of the oligonucleotide while at the 3' end 12 nucleotides of the 
15 original oligonucleotide are observed; 

Figure 2 (Sequence ID No. 2) shows the DNA sequence contained within the clone 
PSK19R isolated from a random primed cDNA Heliothis virescens library; Sequence is 
flanked by EcoRI sites; 

Figure 3 (Sequence ID No. 3) shows the DNA sequence contained within the clone 
20 pSKl 6. 1 isolated from a random primed cDNA Heliothis virescens library ; 

Figure 4 (Sequence ID No. 4) DNA sequence of 5'RACE products (in bold) fused to 
sequence of clone P SK16.1. The ORF (open reading frame) giving rise to the Heliothis 
virescens ecdysone receptor protein sequence is shown under the corresponding DNA 
sequence; 

25 Figure 5 (Sequence ID No. 5) shows the protein sequence alignment of the ecdysone 

receptors DmEcR (Drosophila melanogaster), CtEcR (Chironomus tentans), BmEcR 
(Bombyx mori), MsEcR (Manduca sexta), AaEcR (Aedes aegipti) and HvEcR (Heliothis 
vtrescens). indicates conserved amino acid residue. - - indicates a conservative amino 
acid exchange; 

Figure 6 shows a model of an embodiment of the glucocorticoid/^/,*/** ecdysone 
chimeric receptor useable as a gene switch; 

Figure 7 shows a plasmid map of the clone pcDNA319R. The three other mammalian 
expression vectors were constructed in the same way and look similar but for the size of the 
insert; 

Figure 8 shows a plasmid map of the reporter construct used to analyse the activity of 
the Heliothis virescens ecdysone receptor, 



30 
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Figure 9 is a graph which shows the effect of Muristerone A and RH5992 in reporter 
activity in HEK293 cells co-transfected with pcDN A3H3 KHEcR alone (filled bars) or with 
(XRXR (stripped bars); 

Figure 10 shows a plasmid map of the Maize expression vector containing the 
5 Glucocorticoid receptor (HG1 or pMF6HGlPAT); 

Figure 1 1 shows a plasmid map of the maize expression vector containing the chimeric 
glucocorticoid/Dro sophila ecdy sone receptor pMF6GREcRS ; 

Figure 12 shows a plasmid map of the maize expression vector containing the chimeric 
%\ucocoT\ico\6JHeliothis ecdysone receptor pMF6GRHEcR; 
10 Figure 13 shows a plasmid map of the plant reporter Plasmid containing the 

glucocorticoid response elements fused to the -60 S35CaMV promoter fused to GUS, 
p221.9GRE6; 

Figure 14 shows a plasmid map of the plant reporter plasmid containing the 
glucocorticoid response elements fused to the -46 S35CaMV promoter fused to GUS, 
15 p221.10GRE6; 

Figure 15 shows a graph showing the effect of Muristerone A and Dexamethasone in 
Maize AXB protoplasts transformed with pMF6HGlPAT (GR) and p221.9GRE6 (reporter); 

Figure 16 shows a graph showing the effect of Muristerone A and Dexamethasone in 
Maize AXB protoplasts transformed with pMF6GREcRS (effector) and p221.9GRE6 
20 (reporter); 

Figure 17 shows a graph showing the effect of Muristerone A and Dexamethasone in 
Maize AXB protoplasts transformed with pMF6GRHEcR (effector) and p221.9GRE6 
(reporter); 

Figure 18 shows a graph showing the effect of RH5849 in Maize AXB protoplasts 
25 transformed with pMF6GREcRS (effector) and p221.9GRE6 (reporter); 

Figure 19 shows a graph showing the effect of RH5992 in Maize AXB protoplasts 
transformed with pMF6GREcRS (effector) and p221.9GRE6 (reporter); 

Figure 20 shows a graph showing the effect of RH5992 in Maize AXB protoplasts 
transformed with pMF6GRHEcR (effector) and p221.9GRE6 (reporter); 
30 Figure 2 1 shows a graph which shows the dose response effect of RH5992 in Maize 

AXB protoplasts transformed with pMF6GRHEcR (effector) and p221.9GRE6 (reporter); 

Figure 22 shows a plasmid map of the tobacco expression vector containing the 
chimeric glucocorticoid/ Drosophila ecdysone receptor, pMF7GREcRS; 

Figure 23 shows a plasmid map of the tobacco expression vector containing the 
35 chimeric glucocorticoid/ Heliothis ecdysone receptor, pMF7GRHEcR; 

Figure 24 shows a graph which shows the effect of RH5992 in Tobacco mesophyll 
protoplasts transformed with pMF6GRHEcR (Effector) and p221.9GRE6 (reporter); 
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Figure 25 shows a plasmid map of the mammalian expression vector containing the 
chimeric glucocorticoid///e//o/Aw ecdysone receptor, pcDNA3GRHEcR; 
Figure 26 shows a plasmid map of the reporter plasmid pSWGRE4; 
Figure 27 shows a graph which shows a RH5992 dose response curve of CHO cells 
5 transfected with pcDNA3GRHEcR and pSWGRE4; 

Figure 28 shows a graph which shows the effect of Muristerone A and RH5992 on 
HEK293 cells co-transfected with pcDNA3GRHEcR and pSWGRE4; 
Figure 29 shows a plasmid map of the binary vector ES 1 ; 
Figure 30 shows a plasmid map of the binary vector ES2; 
1 0 Figure 3 1 shows a plasmid map of the binary vector ES3 ; 

Figure 32 shows a plasmid map of the binary vector ES4; 

Figure 33 shows a plasmid map of the effector construct TEV-B 1 12 made to express 
the HEcR ligand binding domain in yeast; 

Figure 34 shows a plasmid map of the effector construct TEV8 made to express the 
15 HEcR ligand binding domain in yeast; 

Figure 35 shows a plasmid map of the effector construct TEWP16-3 made to express 
the HEcR ligand binding domain in yeast; 

Figure 36 shows aplasmid map of the mammalian expreTsi5n- V eclor "containing the" ~ 

chimeric glucocorticoid VPWHeliothis ecdsysone receptor, pcDNA3GRVP16HEcR- 

Figure 37 shows a plasmid map of the maize expression vector containing the chimeric 
glucocorticoid VP16/Heliothis ecdsysone receptor, P MF6GRVP16HEcR; 

Figure 38 shows a plasmid map of the maize expression vector containing the chimeric 
glucocorticoid VPWHeliothis ecdsysone receptor, P MF7GRVP16HEcR; 

Figure 39 shows a graph which shows the effect of RH5992 in Maize AXB 
protoplasts transformed with P MF6GRVP16HEcR (effector) and p221.9GRE6 (reporter)- 
Figure 40 (Sequence ID No. 6) shows the DNA sequence of the hinge and ligand ' 
binding domains of the Spodoptera exigua ecdysone receptor, 

Figure41 (SequenceroNo.7) S howsmeproteinsequenceahgnnientofme^fe 
19R and Spodoptera SEcR Tag clone hinge and ligand binding domains. indicates 
30 conserved amino acid residue. "." indicates a conservative amino acid exchange; 

Figure 42 shows a graph which shows the effect of RH5992 on Tobacco' mesophyll 
protoplasts transformed with P MF7GRHEcR (effector) and either p221.9GRE6 (Horizontal 
strips) or p22 1 . 10GRE6 (vertical strips). 
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Example I - Cloning of the Heliothis Ecdysone Receptor 

A. Probe generation 

5 The rational behind the generation of the probe to isolate Heliothis homologues to 

the steroid/thyroid receptor superfamily members was based on comparing the sequences of 
developmentally regulated steroid/thyroid receptor superfamily members. The sequences 
. available showed a highly conserved motif within the DNA binding domain of the RAR and 
THR (thyroid) receptors. The motifs were used to design degenerate oligonucleotides for 
10 PCR amplification of sequences derived from cDNA template produced from tissue expected 
to express developmentally regulated steroid/thyroid receptor superfamily members (ie. larval 
tissues). 

The sense oligonucleotide is based on the peptide sequence CEGCKGFF which at the 
DNA level yields an oligonucletide with degeneracy of 32 as shown below : 

15 

2nFA5 ' 5 1 TGC GAG GGI TGC AAG GAI TTC TT 3 ' 
T A T A T 

The antisense oligonucleotide is based on the reverse complement nucleotide sequence 
derived from the peptide: 
20 CQECRLKK 

S R 

for which four sets of degenerate oligos were made. Namely: 

2nFA3 ' 5 1 TTC TTI AG I CGG CAC TCT TGG CA 3 • 
25 T A T C A 

ZnFB3 ' 5' TTC TTI AAI CGG CAC TCT TGG CA 3' 
T A T C A 

30 ZnFC3 ' 5 ' TTC TTI AG I CTG CAC TCT TGG CA 3 • 

T A T C A 

ZnFD3 ' 5 ' TTC TTI AAI CTG CAC TCT TGG CA 3 ' 
T A T C A 



35 



The PCR amplification was carried out using a randomly primed cDNA library made 
from mRNA isolated from 4th and 5th instar Heliothis virescens larvae. The amplification 
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was performed using 10« pfus (plaque forming units) in 50mM KC1. 20mM Tris HC1 pH 8 4 
15mM Mgd2, 200mM dNTPs ( an equimolar mixture of dCTP. dATP, dGTP and dTTP) ' 
lOOng of ZnFA5' and ZnF3' mixture. The conditions used in the reaction followed the hot' 
stm protocol whereby the reaction mixture was heated to 94'C for 5 minutes afterwhichl 
5 U of Taq polymerase was added and the reaction allowed to continue for 35 cycles of 93°C 
for 50 seconds. 40'C for 1 minute and 73<>C for 1 minute 30 seconds. The PGR products 
were fractionated on a 2%(w/v) agarose gel and the fragment migrating between 100 and 
200bp markers was isolated and subcloned into the vector pCRU (Invitrogen). The sequence 
of the insert was determined using Sequenase (USB). 
» The resulting sequence was translated and a database search carried out. The search 

recovered sequences matching to the DNA binding domain of the Drosophila ecdysone 

tlT' Z J 7 C "* reCePt ° r reCCpt0r - ^ *" SC " uence ° f - -ert in 

thisplasmid, debated pOUIZnf . is a ecdysone cognate sequence (Figure l)and 

was used to screen a cDNA library in other to isolate the complete open reading frame. 

B. Library sri-P^nin^ 

reolic^^Ti 15 ^ CDNA 4th/5th Heli ° thiS *~ ^ and 
_^hca,e filtermade from the plates. The number of plwla^wasTOaOOO: TheinserT 

fragment of pCRIIZnf was reamplified and 50ng were end labelled using T4 Polynucleotide 
Kinase (as described in Sambrook et al 1990). X 

SDS at ^rT7r Prehybridised » ta » 0-25 Wv) Marvel, 5 X SSPE and 0.1%(w/v) 

mfdeltu H t T S ° 1Uti0n " *° 011615 W3S ten rcp,aCCd *** —*» -d 
the denatured probe added. The hybridisation was carried out overnight at 42«C after which 

**" " 6 X + °" 1 %(W/V> SDS " 4200 ^wedbyanotherwashaT 
55C. Thefilterwereex P osedtoX-rayfilm(Kodak)for48 hours before processing 

The developed film indicated the presence of one strong positive signal which was 
p^uepunfiedandfunhercharacterised. The lambda ZAP II phage was in vivo excised (see 
Stratagene Manual) and the sequence determined of the resulting plasmid DNA. The clone 
known as pSK19R (or 19R) contained a 1.933kb cDNA fragment with an open readin Le 
of 467 ammo acids (Figure 2). P SK19R was deposited with the NCIMB on 20 June 1995 and 
has been accorded the deposit No NCIMB 40743. 

^ ^is of pSK19R revealed that a 340 bp EcoRI fragment mapping at the 5' 
end of pSK19R has strong and significant similarities to a Drosophila cDNA encoding 
gIyceraldehyde-3-phosphate dehydrogenase. In order to isolate the correct 5'end sequence 
belongmgto^ Aif , the random primed library was re-screened using a probc containing 
^eSendofthepSK^Rbelongingto^/^ ecdysone receptor. The probe was made by 
^ * C SCnSe ^gonucleotide HecRH3C (5' aattaagcttceaccatgccgttaccaatgccaccgaca 
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3') and antisense oligonucleotide HecrNdel (5' cttcaaccgacactcctgac 3') . The PCR was 
carried out as described by Hirst et al., 1992) where the amount of radioisotope used in the 
labelling was 50uCi of a 32 P-dCTP and the PCR was cycled for 1 minute at 94°C, 1 minute at 
60°C and 1 minute at 72°C for 19 cycles. The resulting 353bp radio labelled DNA fragment 
5 was denatured and added to prehybridised filters as described for the isolation of pSK19R. 
The library filters were made from 15 plates each containing 50000 pfus. The library filters 
were hybridised at 65°C and washed in 3XSSPE + 0.1%SDS at 65°C twice for 30 minutes 
each. The filters were further washed with 1XSSPE + 0.1%SDS for 30 minutes and exposed 
to X-ray film (Kodak) overnight. The film was developed and 16 putative positive plaques 

10 were picked. The plaques were re-plated and hybridised under the exact same conditions as 
the primary screen resulting in only one strong positive. The strong positive was consistently 
recognised by the probe and was plaque purified and in vivo excised. The resulting plasmid 
pSK16.1 was sequenced (Seq 1D3) which revealed that the 5' end of the clone extended by 
205 bp and at the 3' end by 653 bp and resulting in a DNA insert of 2.5 kb. Conceptual 

15 translation of the 205 bp yielded 73 amino acids with high similarity to the Drosophila , Aedes 
aegipti, Manduca and Bombyx sequences of the ecdsysone receptor B 1 isoform. However, 
the whole of the 5' end sequence is not complete since a Methionine start site was not found 
with a stop codon in frame 5' of the methionine. In order to isolate the remainder of the 5* 
end coding sequences a 5' RACE protocol (Rapid Amplification of cDNA Ends) was carried 

20 out using the BRL-GIBCO 5* RACE Kit. Two types of cDNA were synthesized where the 
first one used a specific oligonucleotide : 
16PCR2A5' cagctccaggccgccgatctcg3' 

and the second type used random hexamers (oligonucleotide containing 6 random 
nucleotides). Each cDNA was PCR amplified using the oligonucleotides anchor primer : 

25 BRL-GIBCO 5* cuacuacuacuaggccacgcgtcgactagtacgggiigggiigggiig 3* 

and 16PCR2A and cycled for 1 minute at 94°C, 1 minute at 60°C and 1 minute at 72°C for 35 
cycles. The reaction conditions were 20mM Tris-HCl (pH8.4), 50mM KC1, 1.5mM MgCl 2 , 
400nM of each anchor and 16PCR2A primers, 200mM dNTPs (dATP.dCTP.dGTP and 
dTTP) and 0.02 U/ml Taq DNA polymerase. Dilutions of 1:50 of the first PCR reactions 

30 were made and 1ml was use in a second PCR with oligonucleotides UAP : 
(Universal Amplification Primer 5' caucaucaucauggccacgcgtcgactagtac 3') 
and 16RACE2 : 

(5* acgtcacctcagacgagctctccattc 3')- 

The conditions and cycling were the same as those followed for the first PCR. 
35 Samples of each PCR were run and a Southern blot carried out which was probed with a 5* 
specific primer : 

(16PCR1 5* cgctggtataacaacggaccattc 3')- 
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^ Hiis primer is specific for the 5' most sequence of pSK16.1 and was hybridised at 
55 C using the standard hybridisation buffer. The filter was washed at 55°C 3 times in 
3XSSPE + 0. 1 %SDS and exposed to X-ray film for up to 6 hours. The developed film 
revealed bands recognised by the oligonucleotide migrating at lOObp and SOObp (relative to 
5 the markers). A sample of the PCR reaction (4 in total) was cloned into the pCRU vector in 
the TA cloning kit (Invitrogen). Analysis of 15 clones from 4 independent PCRs yielded 
sequence upsteam of pSK16.1 (Figure 4). 

Translation of me ORF results in a 575 an™ ^ 
DNA and hgand binding domains when compared to the ecdysone receptor sequences of 
Drosophtla , Aedes aegypti, Chironomus tentans, Manduca sexta and Bombyx mori (Figure 
5). Interestingly, the N-tenninal end of thcHeliothis sequence has an in frame methinonine 
start which is 20 amino acids longer that that reported for Drosophila , Aedes aegypti and 
Manduca sexta. However, the extended N-terminal end in the Heliothis EcR does not have 
similarity to that of B 0 m*y*m 0 „-. Finally, the C-terminal end of the different Bl isofonn 
«5 ecdysone receptor sequences diverge and do not have significant similarity. 

C. Norther n Blot Analy cic 

-Thesequence-identified by screening the library is expected tobe expiessed"in"tissues" " 

undergoing developmental changes, thus mRNA from different developmental stages of H 
20 virescens were was isolated and a northen blot produced. The mRNAs were isolated from ' 
eggs, 1st, 2nd, 3rd, 4th and 5th instar larvae, pupae and adults. The northern blot was 
hybridised with a Ndel/Xhol DNA fragment from pSKIPR encompassing the 3'end of the 
DNA bmding domain through to the end of the ligand binding domain. The hybridisation 
was carried out in l%(w/v)Marvel, 5X SSPE, 0.1%(w/v) SDS at 65'C for 18 to 24 hours 
The filters were washed in 3X SSPE + 0.1%(w/v) SDS and IX SSPE + 0.1%(w/v) SDS at 
65 C. The filter was blotted dry and exposed for one to seven days. The gene recognises two 
transcripts (6.0 and 6.5 kb) which appear to be expressed in all stages examined, however the 
levels of expression differ in different stages. It should be noted that the same two transcripts 
are recognised by probes specific to the DNA binding domain and the ligand binding domain 
indicating that the two transcripts arise from the same gene either by alternative splicing or ' 
alternative use of polyadenylation sites. 

In summary, adult and 5th instar larvae have lower levels of expression while all other 
tissues have subtantial levels of expression. 



25 
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Example EE Expression of Heliothis ecdysone receptor in Mammalian cells 

To demonstrate that the cDNA encodes a functional ecdysone receptor, effector 
5 constructs were generated containing the HEcR under the control of the CMV 
(cytomegalovirus) promoter, and the DN A expressed in mammalian cells. 
Effector constructs 

A first mammalian expression plasmid was constructed by placing a HindlQ/Notl 
pSK19R fragment into the pcDNA3 HindDI/Notl vector resulting in pcDNA3 19R (Figure 7). 
10 A second effector plasmid was constructed wherein the non-coding region of the 

cDNA 19R was deleted and a consensus Kozak sequence introduced. The mutagenesis was 
carried out by PCR amplifying a DNA fragment with the oligo HecRH3C : 

5'aattaagcttccaccatgccgttaccaatgccaccgaca 3' 
containing a unique HindlQ restriction enzyme recognition site followed by the mammalian 
15 Kozak consensus sequence, and HecRNdel : 
5'cttcaaccgacactcctgac 3*. 

The resulting 353bp PCR fragment was restriction enzyme digested with Hindm and 
Ndel. gel purified and ligated with 19R Ndel/NotI fragment into a pcDNA3 Hindm/Notl 
vector resulting in pcDNA3HecR. 

20 A third effector construct was made with the 5' end sequences of pSK16. 1 by PCR. 

The PCR approach involved PCR amplifying the 5 T end sequences using a 5* oligonuclotide 
containing a Hindm restriction cloning site, the Kozak consensus sequence followed by 
nucleotide sequence encoding for a Methionine start and two Arginines to be added to the 5" 
end of the amplified fragment : 

25 (1 6H3K 5' attaagcttgccgccatgcgccgacgctggtataacaacggaccattc 3 '), 

the 3' oligonucleotide used was HecrNdel. The resulting fragment was restriction enzyme 
digested, gel purified and subcloned with an Ndel/NotI 19R fragment into pcDNA3 
Ndel/NotI vector. The plasmid was named pcDNA3H3KHEcR. 

A fourth effector construct was produced which contains the extended N-terminal end 

30 sequence obtained from the 5* RACE experiment. Thus, a PCR approach was followed to 
introduce the new 5' end sequences in addition to a consensus Kozak sequence and a Hindm 
unique cloning sequence. The sense oligonucleotide used was RACEH3K : 

5* attaagcttgccgccatgtccctcggcgctcgtggatac 3\ 
while the antisense primer was the same as that used before (HecrNdel). The cloning strategy 

35 was the same as used for the pcDN A3H3 KHEcR to give rise to pcDNA3RACEH3KHEcR. 
The PCR mutagenesis reactions were carried out in the same manner for all 
constructs. The PCR conditions used were 1 minute at 94°C, 1 minute at 60°C and 1 minute 
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at 72*C for 15 cycles. The reactions conditions were 50mM Tris-HCl (dH8 4) 25 m M icm 

»£. <™ ju„. dcrp, < 01P ^ ^ ^ of ^ Ka - 

iSU/Rau™* of T<u, DNA pel y™^. R,,^ cmsmct „ ^ , 

Report er constrarf 

The reporter plasmid to be co-transfected with the expression vector contained 4 

Recr3A 

gcacttgtccaatctgcagagacaagggttcaatgcacttgtccaatata- gacaagggttcaat 
Recr3B 



^"gtctaagcttattggacaagtgcattgaacccttgtcta-3'. " ~ " " - - 8 - 6 - - WC ^ g 

The resulting 135bp DNA fragment was ligated to the vector pSWBGAL Soel/CM 

Crrhorr"^, thePrcSenCe0fligand - ""-^—reliesuponthepreselc ^ 

Mammalian franefqetinn mgth nrf. 

25 k "^ sfections of mammalian cell lines (CHO-K1 Chinese hamster ovary)- ATCC 

«CL61 or cos-1 (Monkey -^I^^p^^^ J^,^ 

^.(1985) IRL Press, Oxford ) or using LipofectAMINE (Gibco BRL Cat. No 18324-012 

resulted in a 2 3 fold TVT^ * **" Muristerone A or RH5992 

„7" I ?! 93 06118 W6rc USedsincethe yare known to have constitutive levels of 
MunsteroneA (Yao., 1993). Moreover, to further investigate the need for RXR 
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interactions, a ccRXR was co-transfected into HEK293 cells (along with the effector and 
reporter) resulting in a 9 fold induction of reporter activity compared to the untreated cells 
(Figure 9). The co-transfection of ocRXR with reporter and effector increased by four fold the 
reporter activity compared to cells transfected with effector and reporter alone. Induction was 

5 observed both in the presence of either Muristerone A or RH5992. These data clearly 

demonstrate that the cDNA HEcR encodes a functional ecdysone receptor. Moreover, The 
ability of HEcR to complex with ocRXR and bind Muristerone A or RH5992 provide evidence 
for the usage of the entire HEcR as a component of a mammalian gene switch. In particular, 
it offers the advantage of reducing uninduced expression of target gene since ecdysone 

10 receptor and response elements are not present in mammalian cells. 

Example III - Chimeric constructs and ligand validation in Maize Protoplasts 

In order to apply the ecdysone receptor as an inducible system it was deemed necesary 
15 to simplify the requirements of the system by avoiding the need of a heterodimer formation to 
obtain an active complex. The glucocorticoid receptor is known to form homodimers and 
chimeric constructs of the glucocorticoid receptor transactivating and DNA binding domains 
fused to the ecdysone receptor hinge and ligand binding domains have been shown to be 
active as homodimers in mammalian cells in the presence of Muristerone A (an ecdysone 
20 agonist)(Christopherson et al., 1992). However, the chimeric receptor is not responsive to 
20-hydroxyecdysone (Christopherson et al., 1992). 

The analysis of the activation of the glucocorticoid/i/c/zof/iw ecdysone chimeric 
receptor entailed the production of two other control effector constructs. The first one of the 
constructs contained the intact glucocorticoid receptor while the second one contained a 
25 glucocorticoid/Dro5op/w7fl ecdysone chimeric receptor. 
Effector constructs 

(i) Glucocorticoid receptor Maize expression construct 

The glucocorticoid receptor DNA for the Maize transient expression construct was 
produced via the polymerase chain reaction (PCR) of Human Fibrosarcoma cDNA (HT1080 

30 cell line, ATCC#CC1121) library (Clontech)(see Hollenberg et aL, 1985). The PCR approach 
taken was to amplify the 2.7kb fragment encoding the glucocorticoid receptor in two 
segments. The first segment entails the N-terrninal end up to and including the DNA binding 
domain while the second fragment begins with the hinge region (amino acid 500) thought to 
the end of the reading frame. Thus, the PCR primer for the N-terminal end segment was 

35 designed to contain an EcoRI site and the Kozak consensus sequence for translation initiation 

GREcoRI 5'attgaattccaccatggactccaaagaatcattaactc 3'. 
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™ , ^ 3 ' Cnd Phmer C ° ntainS a ^ Site 111 ^ with frame at amino acid 

500 of the published sequence : 

GRXhoI 5' gagactcctgtagtggcctcjagcattccttttatttttttc 3'. 

The second fragment of the glucocorticoid receptor was produced with a 5' end 
« ohgonucleotide containing an Xhol site in frame with the open reading frame at the begining 
or the hinge region (amino acid 500) : 
GRHinge 5" attcjcgagattcagcaggccactacaggag 3" 

while the 3' end oligonucleotide contained an EcoRI site 400 bp after the stop codon : 
GRStop 5' attgaattsaatgctatcgtaactatacaggg 3'. 

The glucocorticoid receptor PCR was carried out using Vent polymerase (Biolabs) 
under hot start conditions followed by 15 cycles of denaturing (94'C for 1 minute), annealing 
(66 C for 1 minute) and DNA synthesis (72°C for 3 minute). The template was produced by 
makmg first strand cDNA as described in the TA cloning kit (Invitrogen) after which the PCR 

O ZT?\ " ^ KC1 ' 10mM (NH4)2S ° 4 ' 20mM TRIS - Ha P H 8 " 8 . 2 MgSO. 
0.1% (v/v) Tnton X-100, 200 mM dNTPs, lOOng of each Primer and 2 U of Vent 

Polymerase. The PCR products was restriction enzyme digested with EcoRI and Xhol and 

subc oned into pBluescript SK ( P SK) EcoRI. The resulting plasmid pSKHGI was sequenced 

the PCR pnmers) (Hollenberg et al., 1985). 

• 2 ' ?kb EC0RI fragmCnt WaS subcloned i*° »c vector pMF6PAT EcoRI resulting 

m pMF6HGIPAT (Figure 10). 

receptor 6 CXPreSSi ° n a Glucocorticoid/ Drosophila ecdysone chimeric 

Q ^ ™ C glucocorticoid receptor portion of the chimeric receptor was isolated from 
pSKHGI by producing a 1.5kb BamHI/XhoI restriction fragment containing the N-terminal 
end up to and including the DNA binding domain. 

The Drosophila ecdysone receptor portion was isolated through PCR of first stand 
cDNA prepared from Drosophila adult mRNA. The PCR was carried out using a 5' 
ohgonucleotide containing a Sail site (ie. Drosophila ecdysone receptor contains a Xhol site 
at the end of the ligand binding domain) which starts at the begining of the hinge region • 
aimno acid 330, Ecr8 attasgacaacggccggaatggctcgtcccggag 3'. 

The 3' end oligonucleotide contains an BamHI site adjacent to the stop codon • 
EcRstop 5' tcgggctttgttaggatcctaagccgtggtcgaatgctccgacttaac 3'. 

^ The PCR was carried out under the conditions described for the amplification of the 
Glucocorticoid receptor and yielded a 1.6 kb fragment. The fragment was introduced into 
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pSK Sall/BamHI and the sequence determined and compared to the published one (Koelle et 



The maize transient expression plasmid was produced by introducing into pMF6 
BamHI vector the 1.5kb BamHI/XhoI glucocorticoid receptor fragment and the 1.6kb 
Sall/BamHI Drosophila receptor portion to yield the chimeric plasmid pMF6GREcRS 
(Figure 9). 

(iii) Construction of the GlxxcocomcoidlHeliothis ecdysone chimeric receptor Maize transient 
expression plasmid. 

The Glucocorticoid receptor portion of the chimera was produced as describe in 
Example II(ii). The production of the Heliothis ecdysone receptor portion involves the 
introduction of a Sail recognition site at the DNA binding/hinge domain junction (amino acid 
229). The addition of the Sail site : 
Hecrsal 5'attgtcgacaaaggcccgagtgcgtggtgccggag 3' 

was achieved via PCR mutagenesis making use of an unique AccI site 107bp downstream of 
the juction point (or 1007 bp relative to Seq ID No 4): 
Hecracc 5' tcacattgcatgatgggaggcatg 3*. 

The PCR was carried out using Taq polymerase (2.5 U) in a reaction buffer containing 
lOOng of template DNA (pSK19R), lOOng of Hecrsal and Hecracc, 20 mM TRIS-HC1 pH 
8.4, 50mM KC1, lOmM MgCl 2 , 200mM dNTPs. The reaction was carried out with an initial 
denaturation of 3 minutes followed by 15 cycles of denaturation (1 minute at 94°Q, annealing 
(1 minute at 60°C) and DNA synthesis ( 1 minute at 72°C). The DNA was restriction enzyme 
digested and subcloned into pSK Sall/SacI with the 1.2kb AccI/SacI 3' end HecR fragment to 
yield pSK HeCRDEF (or containing the hinge and ligand binding domains of the Heliothis 
ecdysone receptor). The construction of the maize transient expression plasmid containing 
the Q\ucocot\icoidJHeliothis ecdysone chimeric receptor involved the ligation of pMF6 
EcoRI/SacI with the l^kb EcoRI/XhoI fragment of Glucocorticoid receptor N-terminal end 
and the 1.2 kb Sall/SacI fragment of pSk HEcRDEF to yield pMF6GRHEcR (Figure 10). 
Reporter plasmids 

Two reporter plasmids were made by inserting the into p221.9 or p221.10 
BamHI/Hindin vectors two pairs or oligonucleotides containing six copies of the 
glucocorticoid response element (GRE). The two sets of oligonucleotides were designed 
with restriction enzyme recognition sites so as to ensure insertion of the two pairs in the right 
orientation. The first oligonucleotide pair GRE1A/B is 82 nucletides long and when annealed 
result in a DNA fragment flanked with a Hindm site at the 5' end and a Sail site at the 3* end : 
GRE1A 

S'agcttcgactgtacaggatgttctagctactcgagtagctagaacatcctgtacagtcgagmgctagaacatcct 3* 



al.,1991). 



BNSDOCID: <WO 9637609A1 _l_> 



WO 96/37609 j _ i 

PCT/GB96/01195 

-26- W 



GREIB 



15 



20 



25 



30 



35 



S^tcgactgtacaggatgttctagctactcgac^^ cagtcga y 

The second pair of oligonucleotides is flanked by a Sail site at the 5' end and a Bamffl 
site at the the 3* end 

* GRE2A 5' tcgactagctagaacatcctgtacagtcgagtagctagaacatcctgt 
acagtcgagtagctagaacatcctgtacag 3* 

GRE2B5-gatcctg te caggatgttcugctac«cgactgtacaggatgttctagctactcgactg^ 

The resulting plasmids were named p221.9GRE6 (Figure 13) and p221 10GRE6 
(F.gure 14)(used in later Example). The difference between P 221.9 and p221.10 plasmids is 
that P 221.9 contains the -60 35SCaMV minimal promoter while P 221.10 (p221 10GRE6) 
contains the -46 35SCaMV minimal promotor. 
Method 

Protoplasts were isolated from a maize suspension culture derived from BE70 x A188 
embryogenic callus material, which was maintained by subculturing twice weekly in MSO.5^ 
CMS medium supplemented with 3% sucrose, 690mg/l proline, lg/1 myo-inositol, 0.2g/l casein 
acid hydrolysate, 0.5mg/l 2,4-D, pH5.6). Cells from suspensions two days post subculture 

Zi r'r^T*' {2 ° % CeUU,aSC M ' °- 2% Y23. 0.5M Mannitol, 

- -5mM CaCl 2 2H 2 0. 0.-5% MES, pH5.6, -660mmol/kg) using -lOmJ/g eelisjneubating ~ 
dim hght routing gently for - 2 hours . ^ ^ ^ • 

250um and 38pm sieves, and the filtrate centrifuged at 700rpm for 3.5 minutes, discarding the 
supernatant. The protoplasts were resuspended in wash buffer (0.358M Kd 1 OmM 
NH^O, 5 .0mM CaCl 2 2H 2 0, 0.5mM KH 2 P0 4 , P H4.8, ~670mmol/k g ) and pelleted as before. 
This washing step was repeated. The pellet was resuspended in wash buffer and the 
protoplasts were counted. Transformation was achieved using a Polyethylene glycol method 
based on Negrutiu et.al. Protoplasts were resuspended at 2 x 10</ml in MaMg medium (0 4M 
MannitoU5mM MgCl, 0. 1% MES, pH5.6. ^Ommol/kg) aHquotting 0.5ml / treatment (i.e. 
1x10 protoplasts / treatment). Samples were heat shocked at 45°C for 5 minutes then cooled 
to room temperature. lOug each of p221.9GRE6 and P MF6HR1PAT (GR) (lmg/ml) / 

TT™ZT 31,(1 m genUy ' f ° U0Wed ^ ****** Edition of 0.5ml warm 
(-45 Q PEG solution (40% PEG 3.350MW in 0.4M Mannitol, 0.1M Ca^O,),, P H8.0) 
which was mixed in thoroughly but gently. Treatments were incubated at room temperature 
for 20-25 minutes, then 5ml 0.292M KC1 ( P H5.6. ~530mmol/kg) was added step-wise. 1ml at 
a time. w,th mixing. The treatments were incubated for a further 10-15 minutes prior to 
pelleting the protoplasts by centrifuging as before. Each protoplast treatment was 

l 5al ^ mediUm <*» medium ' 2% sucrose - « 2,4-D, 9% Mannitol, 
PH5.6, -700mmoWcg) +/- 0.0001M dexamethasone (glucocorticoid). The samples were 
incubated in 3cm dishes at 25°C, dark, for 2^48 hours prior to harvesting. Fluorometric 
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assays for GUS activity were performed with the substrate 4-methylumbeiliferyl-D- 
glucuronide using a Perkin-Elmer LS-35 fluorometer (Jefferson et al., 1987). Protein 
concentration of tissue homogenates were determined by the Bio-Rad protein assay 
(Bradford, 1976). The method was repeated for each effector construct. 
5 Results 

Reporter gene assay 

A reporter gene construct (p221.9GRE6) was generated containing the GUS reporter 
gene under the control of a -60 CaMV 35S promoter with 6 copies of the glucocorticoid 
response element. To test this construct was functional in maize protoplasts a co- 

10 transformation assay was performed with the reporter construct p22 1.9GRE6 and the effector 
construct pMF6HR!PAT (GR) construct containing the entire glucorticoid receptor. 

Figure 15 shows that Reporter p221.9GRE6 alone or reporter plus effector 
pMF6HRlPAT (GR) with no activating chemical gave no significant expression. When 
reporter plus effector were co-transformed into maize protoplasts in the presence of 

15 0.0001M dexamethasone (glucocorticoid), a significant elevation of marker gene activity was 
observed (Figure 15). The response is specific to glucorticoid as the steroid Muristerone A 
does not lead to induced levels of expression. These studies clearly show the reporter gene 
construct p221.9GRE6 is capable of monitoring effector /ligand mediated gene expression. 
Chimeric ecdysone effector constructs mediate inducible expression in maize transient 

20 protoplasts assays 

A chimeric effector plasmid pMF6GREcRS was constructed, containing the ligand 
binding domain from the Drosophila ecdysone receptor and the DNA binding and 
transactivation domain from the glucorticoid receptor. To confirm the reporter gene 
construct p221.9GRE6 could respond to a chimeric ecdysone effector construct, a series of 

25 co-transformauon into maize protoplasts was performed. 

Figure 16 shows that reporter (p221.9GRE6) alone or reporter plus effector 
(pMF6GREcRS) with no activating chemical, gave no significant expression in maize 
protoplasts. When reporter plus effector were co-transformed into maize protoplasts in the 
presence of lOOuM Muristerone A, a significant elevation of marker gene activity was 

30 observed. The response was specific to Muristerone A, as the steroid dexamethasone did not 
lead to induced levels of expression. These studies clearly showed the reporter gene 
construct p221.9GRE6 is capable of monitoring chimeric ecdysone effector /ligand mediated 
gene expression. 

A second chimeric effector construct pMF6GRHEcR, was generated containing the 
35 ligand binding domain from Heliothis ecdysone receptor. When co-transformed into maize 
protoplasts with the reporter plasmid p221 .9GRE6, no response to lOOuM Muristerone or 
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lOOuM dexamethasone was observed (Figure 17). These data clearly show the Drosophila 
and Hehoihis ligand binding domains exhibit different properties. 

When the effector plasmid P MF6GREcRS, containing the ligand binding domain from 
Drosophxla , was tested with the reporter P 221.9GRE6 in presence of the non-steroidal 
5 ecdysone agonists RH5849 and RH5992 (mimic), no chemical induced reporter gene activity 
was observed (Figures 18 and 19). «.uv«y 

When the effector plasmid pMFoGRHEcR, containing the ligand binding domain from 
HeUothu , was tested with the reporter P 221.9GRE6 in presence of the non-steroidal 
ecdysoneagonists RH5992 (mimic), significant chemical induced reporter gene activity was 
' observed (Figure ; 20). These data demonstrate the ligand binding domain from HelioZsZ 
different properties to the Drosophila receptor in that the former responded to the non- 
s^dal ecdysteroid agonist RH5992. Figure 21 demonstrates the effector plasmid 
PMF6GRHECR confers RH5992 dependant inducibility on the reporter p221.9GRE6 in a 
dose respons, V e manner. Induction was observed in a range from luM-lOOuM RH5992. 

Example IV - Testing of effector vectors in Tobacco protoplasts 

" ^7 R H59 1 Q?r Peri T entS me-spe^if7c effect " " 

of RH5992 (nunuc) on pMF6GRHEcR in maize protoplasts. It is the aim in this example to 
show the generic application to plants of the glucocorticoid/^/^ ecdysone chimeric 
receptor switch system. Tobacco shoot cultures cv. Samsun, were maintained on solidified 
MS medmm + % sucrose in a conned envmonment room (16 hour day / 8 hour night at 

1 >Z 1 \ J WCrc " *" S0UrCe material fOT Prot ° plasts - were ^ed Parallel 

to the mid-nb. discarding any large veins and the slices were placed in CPW13M 13% 

Manmtol, pH5.6, -860mmol^g) for -1 hour to pre-plasmolyse the cells. This solution was 

T^tZ^ e Z me ^ m% CeUUlaSe R1 °- 0 05% Macero ^ RIO in CPW9M 
(CPW13M but 9% Mannitol), pH5.6, -600mmoWcg) and incubated in the dark at 25'C 
overnight (-16 hours). Following digestion, the tissue was teased apart with forceps and any 
large undigested pieces were discarded. The enzyme mixture was passed through a 75um 
sieve and the filtrate was centrifuged at 600rpm for 3.5 minutes, discarding the supernatant 
The pellet was resuspended in 0.6M sucrose solution and centrifuged at 600rpm for 10 

wTSw^m^ ,ayCr ° f Pr ° t0PlaStS rem ° Ved U&mS 3 ^ and diluted 
^CPW9M (PH5.6, -560mmo^g). The protoplasts were again pelleted by centrifuging at 
eOQrpm for 3.5 minutes, resuspended in CPW9M and counted. A modified version of the 
FEG-mechated transformation above was carried out Protoplasts were resuspended at 
2x10 /ml in MaMg medium and aliquotted using 200ul / treatment (i.e. 4x10 s protoplasts / 
treatment). 20ug each of P MF6GRHEcRS and P 221.9GRE6 DNA (Img/ml) were added 
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followed by 200ul PEG solution and the solutions gently mixed. The protoplasts were left to 
incubate at room temperature for 10 minutes before addition of 5ml MSP19M medium (MS 
medium, 3% sucrose, 9% Mannitol, 2mg/l NAA, 0.5mg/l BAP, pH5.6, -700mmol/kg) +/- 10 
uM RH5992. Following gentle mixing, the protoplasts were cultured in their tubes, lying 
5 horizontally at 25°C, light. The protoplasts were harvested for the GUS assay after -24 hours. 
Effector construct 

(i) Construction of a Dicotyledonous expression vector 

The vector produced is a derivative of pMF6. pMF6GREcRS was restriction enzyme 
digested with PstI to produce 3 fragments namely, 3.4{Adh Intronless pMF6), 3.2(GREcRS) 

10 and 0.5(Adh intron I) kb). Isolation and religation of the 3.4 and 3.2 kb fragments resulted in 
pMF7GREcRS (Figure 22). pMF7GREcRS was restriction enzyme digested with EcoRI/SacI 
resulting in the 3.4kb pMF7 EcoRI/SacI vector which when isolated and purified was ligated 
to a 1.5 kb EcoRI/XhoI N-terminal end of the glucocorticoid receptor and the 1.2 kb 
S all/Sac I Heliothis ecdysone C-terminal end sequences to produce pMF7GRHEcR (Figure 

15 23). 

Reporter plasmid 

The reporter plasmids constructed for the maize transient experiments were the same 
as those used without alteration in the tobacco leaf protoplast transient expression 
experiments. 

20 Results - Chimeric ecdysone effector constructs mediate inducible expression in tobacco 
transient protoplast assays 

Experiments were performed to demonstrate that the effector plasmid pMF6GRHEcR 
can confer chemical dependant inducible expression on the reporter p221.9GRE6 in tobacco 
mesophyll protoplasts. 

25 Figure 24 shows that reporter (p22 1 .9GRE6) alone or reporter plus effector 

(pMF7GRHEcR) with no activating chemical, gave no significant expression in tobacco 
protoplasts. When reporter plus effector were co-transformed into tobacco protoplasts in the 
presence of lOuM RH5992, a significant elevation of marker gene activity was observed. 
These data show a chimeric ecdysone effector construct, containing the Heliothis ligand 

30 binding domain can confer non-steroidal ecdysteroid dependant expression on reporter gene 
constructs in both monocotyledonous and dicotyledonous species. 
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Example V - Chimeric activity in Mammalian cells 

Effector construct; 

5 (i) Construction of Glucocorticoid/ffWwrtw ecdysone chimeric receptor. 

The mammalian expression vector used in this experiment was pcDNA3 (Invitrogen) 
The GRHEcR 2.7kb BamHI DNA fragment (isolated from pMF6GRHEcR) was introduced ' 
into the pcDNA3 BamHI vector. The recombinants were oriented by restriction enzyme 
mapping. The DNA sequence of the junctions was determined to ensure correct orientation 
1 0 and insertion (pcDNA3GRHEcR, Figure 25). 

Reporter rnnctm/.| 

The reporter plasmid for mammalian cell system was produced by taking pSWBGAL 
plasmid and replacing the CRESW Spel/Clal fragment for a synthetic 105 bp DNA fragment 
containing 4 copies of the glucocorticoid response element (GRE) and flanked by Spel at the 
15 5' end and Afin at the 3' end. 

The oligonucleotides were synthesised using the sequences • 
GREspel 

20 GREafi2 

g^SgtS 

The two oligonucleotides were purified annealed and ligated to pSWBGAL Spel/Afin 
to produce pSWGRE4 (Figure 26). 

25 R «ults - Chimrrir HEcR drives transient rennrt^ r n» in mamtn „, ian ^ 

No expression was detected when a reporter gene construct pS WGRE4, comprising 
of a minimal p-globin promoter containing four copies of the glucocorticoid response 
element, fused to a p-galactosidase reporter gene, was introduced into CHO cells. Similarly 
no expression was detected when pS WGRE4 and an effector plasmid pCDNA3GRHEcR 

30 containing the transactivation and DNA binding domain from the glucocorticoid receptor'and 
the hgand bonding domain from the Heliothis ecdysone receptor, under the control of the 
CMV promoter were co-transformed into CHO-K1 or HEK293 cells. When co-transformed 
CHO (Figure 27) and HEK293 cells (Figure 28) were incubated in the presence of the non- 
steroidal ecdysone agonists RH5992 (rnimic), significant chemical induced reporter gene 

•5 actmty was observed. Equally, induction of reporter activity was observed when HEK293 
cells transfected with pcDNA3GRHEcR and reporter were treated with Muristerone A 
(Figure 28). 
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Example VI - Screening system allows new chemical activators and modified ligand 
binding domains to be tested in Mammalian cells 

5 The basis of a screening system are in place after the demonstration that the chimeric 

receptor was activated in the presence of RH5992. A screen was carried out using CHO cells 
transiently transfected with both pSWGRE4 (reporter) and pcDNA3GRHEcR (effector) 
constructs. In the first instance 20 derivatives compounds of RH5992 were screened. It was 
observed that 7 out of the 20 compounds gave an increased reporter gene activity compared 
10 to untreated cells. A second screen was carried out in which 1000 randomly selected 

compounds were applied to transiently transfected CHO ceils. Two compounds were found to 
activate reporter gene activity above that from the untreated controls. The second screen 
suggest that this cell based assay is a robust and rapid way to screen a small library of 
compounds, where a thousand compounds can be put through per week. 

15 

Example V - Stably transformed Tobacco plants 

Stable Tobacco vectors 

The components of the stable Tobacco vectors were put together in pBluescript prior 
20 to transfer into the binary vector. The production of stable transformed plants entails the 
production of a vector in which both components of the switch system (ie. effector and 
reporter) are placed in the same construct to then introduce into plants. 

The methodology described below was used to produce four different stable Tobacco 
vectors. The method involves three steps: 

25 

1 . pBluescript SK Hindm/EcoRI vector was ligated to either GRE6- 
4635SCaMVGUSNOS Hindm/EcoRI (from p221.10GRE6) or GRE6- 
6035SCaMVGUSNOS Hindm/EcoRI (from p221.9GRE6) resulting in plasmid pSK^6 and 
pSK-60. 

30 

2. This step involves the addition of the chimeric receptor (35SGRHEcRNOS or 
35SGRVP16HEcRNOS) to pSK-60 or pSK-46. Thus a pSK-60 (or pSK-46) Xbal vector 
was ligated with either the 3.4kb 35SGRHEcRNOS Xbal or the 3.0kb 
35SGRVP16HEcRNOS Xbal DNA fragment to produce pSKESl (pSKGRE6- 

35 6035SCaMVGUSNOS-35SGRHEcRNOS), pSKES2 (pSKGRE6-4635SCaMVGUSNOS- 
35SGRHEcRNOS), pSKES3 (pSKGRE6-6035SCaMVGUSNOS-35SGRVP16HEcRNOS) 
and pSKES4 (pSKGRE6-4635SCaMVGUSNOS-35SGRVP16HEcRNOS). 
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t I™*? fr ° m PBIUCSCriPt baSCd VCCt0rS 10 bmar y Vectors ' ™* t»»fcr of the ES 1 
29) ES2 (Figure 30), ES3 (Figure 31) or ES4 (Figure 32) DNA fragments into *e 
binaiy vector JR1 involves five steps: 

5 

uLt e ? h eSttiCti T nZyme digeSti ° D ° fpSKBS1 ^ ES3 ' and ES4 > wi * Apal and Nod to 
liberate the insert from the vector pBluescript. 

2? « ^ DNA fragmentS Werc BamHI methylated for 2 hours at 3TC in TRis Hn 

MgCl, SOuM SAM (S-adenosylmetmonine) and 20 U of BamHI methyl 

(legate a Apal/NotI hnker onto the fragment. The linker was designed to have an interna, 

ApaBNotl 5' cattggatccttagc 3' and 
ApaBNot2 5'ggccgctaaggatccaatgggcc 3'. 

(iv) Restriction enzyme digest the protected and linkered fragment with BamHI and 
fractionate the products on a l%(w,v) Agarose gel. The protected DNA fragmT(5 5kb ) 
was cut out of the gel and purified. "agment (5.5M» 

£Ll ^° n r^ BamHI VCCt0r WiUl Pr0tCCted band W3S Carried « <° P^uce 
«*H (JRIES2 JRJES3 or JRIES4). The DNA of the recombinant was characterised by 
restriction mappmg and the sequence of the junctions determined. 

The plant transformation construct P ES1, containing a chimeric ecdysone receptor and 
a reporter gene cassette, was transferred into Agrobacterium tumefaciens lL^u^ 

Z Z ^ H ° ,SterS Ct * (1978) - ™~ <Nico«ana ^ » - 

Samsun t^sformants were produced by the leaf disc method (Bevan, 1984). ShootsTere 

generated on medium containing lOOmg/1 kanamycin. After rootmg, p,anu K weT 
^sferred to the glasshouse and grown under 1 6 hour light/ 8 hour Lk conditionT 

Transgenic tobacco plants were treated in cell culture by adding lOOuM RH5992 to 

t^* t T~r a *~*™ ^oicaUymthepln^aC o 
RH5992. In further experiments 5mM RH5992 was applied in a foliar application to 8 week 

GvfT°r ^ t0baCC ° ^ * fl - M ^-ed level of 

GUS ^uvity were comparable to a wild type control, while RH5992 levels were significltly 
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Ecdysone switch modulation and optimisation 

Example VIQ - Yeast indicator strains for primary screen of chemical libraries 

5 

A set of yeast indicator strains was produced to use as a primary screen to find 
chemicals which may be used in the gene switch. The properties of the desired chemicals 
should include high affinity resulting in high activation but with different physico-chemical 
characteristics so as to increase the scope of application of the technology. Moreover, the 
10 production of this strain also demonstrates the generic features of this switch system. 
Effector vector 

A base vector for yeast YCpl5Gal-TEV-l 12 was generated containing: 

Backbone - a modified version of pRS315 (Sikorski and Hieter (1989) Genetics 122, 19-27)- 

a shuttle vector with the LEU2 selectable marker for use in yeast; 
15 ADH1 promoter (BamHI- Hind m fragment) and ADH1 terminator (Not I- Bam HI 

fragment) from pADNS (Colicelli et al PNAS 86, 3599-3603); 

DNA binding domain of GAL4 (amino acids 1-147; GALA sequence is Laughon and 

Gesteland 91984) Mol. Cell Biol. 4, 260-267) from pSG424 (Sadowski and Ptashne (1989) 

Nuc. Acids Res. 17, 7539); 
20 Activation domain - an acidic activation region corresponding to amino acids 1-107 of 

activation region Bl 12 obtained from plasmid pBl 12 (Ruden et al (1991) Nature 350, 

250-252). 

The plasmid contains unique Eco RI, Nco I and Xba I sites between the DNA binding 
domain and activation domains. 
25 Into this vector a PCR DNA fragment of the Heliothis ecdysone receptor containing 

the hinge, ligand binding domains and the C-terminal end was inserted. The 5' oligonucleotide 
is flanked by an Ncol restriction recognition site and begins at amino acid 259 : 
HecrNcoI 5* aattccatggtacgacgacagtagacgatcac 3'. 

The 3* oligonucleotide is flanked by an Xbal site and encodes for up to amino acid 

30 571: 

HecRXbal 5' ctgaggtctagagacggtggcgggcggcc 3\ 

The PCR was carried out using vent polymerase with the conditions described in 
Example IA. The fragment was restriction enzyme digested with Ncol and Xbal purified and 
ligated into YCpl5GALTEVl 12 Ncol/Xbal vector to produce YGALHeCRB 1 12 or TEV- 
35 B 1 12 (Figure 34). In order to reduce constitutive activity of the YGALHeCRB 112 plasmid a 
YGALHeCR plasmid was produced in which the B 1 1 2 activator was deleted by restriction 
enzyme digesting YGALHeCRB 1 12 with Xbal/Spel followed by ligation of the resulting 
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vector (ie. Spel and Xbal sites when digested produce compatible ends)(TEV-8 Figure 33) 
An effector plasmid was constructed whereby the B 1 1 2 transactivating domain was excised 
from YGalHecRBl 12 with Xbal and replaced with the VP16 transaction domain DNA 
fragment (encoding amino acids 41 1 and 490 including the stop codon). The resulting vector 
was named YGalHecRVP16 or TEWP16-3 (Figure 35). 
Reporter const ruction far y i»as| 

The S. cerevisiae strain GGY1:: 171 (Gill and Ptashne (1987) Cell 51 121-126) 
YT6::171 (Himmelfarb et al (1990) Cell 63, 1299-1309) both contain reporter plasmids' 
consisting of the GA14-responsive GAL1 promoter driving the E. coli B-galactosidase gene 
These plasmids are integrated at the URA3 locus. The reporter strain YT6::185 contains the 
reporter plasmid pJP185 (two synthetic GAL4 sites driving the B-galactosidase gene) 
integrated at the URA3 locus of YT6 (Himmelfarb et al). (Note- the parental strains YT6 and 
GGY1 have mutations in the GAL4 and GAL80 genes, so the reporter genes are inactive in 
the absence of any plasmids expressing GAL4 fusions). 
15 Yeast assay 

Standard transformation protocols (Lithium acetate procedure) and selection of 
colon.es by growth of cells on selective media (leucine minus medium in the case of the 
YGpl5Gal»TEV-l 12 plasmid)-^described iiTGuthrieand Fink )1991) Guide to YeasT 
Genetics and Molecular Biology: Methods in Enzymology Vol. 194 Academic Press) and the 
reporter gene assay is a modification of that described in Ausabel et al (1993) Current 
Protocols in Molecular Biology (Wiley) Chapter 13). 

Results - Autom ated smr pinft system allows new ch ft mir a l a ^, tors anri mrv1if ^ B 
bindinp domain s to be tested in veast 

An effector vector pYGALHEcRB 1 12 has been generated containing a GAL4 DNA 
bmdrng domain, a B 1 12 activation domain and the ligand binding region from Heliothis 
vtrescens. In combination with a GAL reporter vector, pYGALHEcRB 112 form the basis of 
a rapid, high throughput assay which is cheap to run. This cell-based assay in yeast 
(Saccharomyces cerevisiae) will be used to screen for novel non-steroidal ecdysone agonists 
which may of commercial interest as novel insecticides or potent activators of the ecdysone 
gene switch system. The demonstration of an efficient system to control gene expression in a 
chemical dependant manner, forms the basis of an inducible system for peptide production in 
yeast. 

The yeast screening system forms the basis of a screen for enhanced ligand binding 
using the lac Z reporter gene vector to quantitatively assay the contribution of mutation in the 
ligand bindmg domain. Alternatively, enhanced ligand binding capabilities or with a selection 
cassette where the lac Z reporter is replaced with a selectable marker such as uracil (URA 3) 
tryptophan (TrpI) or leucine (Leu2), and histidine (His). Constructs based on 



20 
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pYGALHEcRBl 12 with alterations in the ligand binding domain are grown under selection 
conditions which impair growth of yeast containing the wild type ligand binding domain. 
Those surviving in the presence of inducer are retested and then sequenced to identify the 
mutation conferring resistance. 

5 

Example IX - Optimisation of chimeric receptor using a strong transactivator 

Construction of mammalian expression plasmid with chimeric receptor containing Herpex 

Simplex VP 16 protein sequences. 
10 The construction of this chimeric receptor is based on replacing the sequences 

encoding for the glucocorticoid receptor transactivating domain with those belonging to the 

VP 16 protein of Herpex simplex. Thus PCR was used to generate three fragments all to be 

assembled to produce the chimeric receptor. The PCRs were carried out as described in 

Example II, iii. The first fragment includes the Kozak sequences and methionine start site of 
15 the glucocorticoid receptor to amino acid 152 of the glucocorticoid receptor." The 

oligonucleotides used for the generation of this fragment included an EcoRI site at the 5* end: 

GR1 A 5* atatgaattccaccatggactccaaagaatc 3' 

and at the 3' end a Nhel restriction enzyme recognition site : 

GR1B 5* atatgctagctgtgggggcagcagacacagcagtgg 3\ 
20 The second fragment also belongs to the glucocorticoid receptor and begins with a 

Nhel site in frame with amino acid 406 : 

GR2A 5'atatgctagctccagctcctcaacagcaacaac 3* 

and ends with a Xhol site at amino acid 500 : 

GR2B S'atatctcgagcaattccttttatttttttc 3'. 
25 The two fragments were introduced into pSKEcoRI/SacI in a ligation containing 

GR1A/B EcoRI/Nhel, GR2A/B Nhel/Xhol and HEcR Sall/SacI (from pSKHEcRDEF) to 

yield pSKGRDHEcR. The GR sequences and junctions of the ligation were found to be 

mutation free. 

The third fragment to be amplified was a sequence between amino acid 41 1 to 490 of 
30 the herpes simplex VP 16 protein. The amplified fragment was flanked with Spel recognition 
sites. Spel produces compatible ends to those of Nhel sites. The oligonucleotides used : 
VP16C 5' at tactagt tctgcggcccccccgaccgat 3' and 
VP16E 5* aattactagtcccaccgtactcgtcaattcc 3' 

produced a 180 bp fragment which was restriction enzyme digested with Spel and introduced 
35 into pSKGRAHEcR Nhel vector to produce pSKGRVP16HEcR. The DNA from the latter 
was sequenced and and found to be mutation free, the junctions were also shown to be in 
frame with those of the glucocorticoid receptor. 
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The 2.2 kb EcoRV/Notl GRVP16HEcR fragment was introduced into a pcDNA3 
EcoRV/NotI vector resulting in pcDNA3GRVP16HEcR (Figure 36). 

Construction of plant transient expressio n effector nb smids mnr,^ , he chim ^ r . , 

with VP 16 Qftfi^prpc 

The same procedure was carried out to clone the GRVP16HeCR DNA fragment into 
tobacco(pMF7b) and maize(pMF6) expression vectors. A 2.2kb BamHI DNA fragment was 
isolated from pcDNA3GRVP16HeCR and ligated in to the P MF6 BamHI (or P MF7b BamHI) 
vector to produce P MF6GRVP16HeCR (Figure 37) (or P MF7GRVP16HeCR) (Figure 38) 
Results - Addition of stmnp a ctivation d omai ns enhan™ ^Hysnne sw^ h .y.,.^ 

The VP16 transactivation domain from herpes simplex virus has been added to a 
maize protoplast vector pMF6GRHEcR to generate the vector pMF6GR VP 1 6HEcR When 
co-transformed into maize protoplasts with the reporter construct p221.9GRE6 in the 
presence of lOOuM RH5992, enhanced levels of expression were seen over pMFdGRHEcR 
Figure 39, shows that RH5992 is able to induce GUS levels comparable to those observed ' 
with the positive control ( P 35SCaMVGUS), moreover, a dose response effect is observable 

VP16 enhanced vectors (pES3 and pES4) have been generated for stable 
transformation of tobacco. Following transformation transgenic progeny containing P ES3 

- and pES4, gave elevated GUS levels following treatment" wim~RH5992, reTative to 

comparable transgenic plants containing the non-VP16 enhanced vectors pESl and pES2 

An enhanced mammalian vector pcDNA3GRVP16HEcR was prepared for transient 
transfection of mammalian cell lines. Elevated reporter gene activities were obtained relative 
to the effector construct (pCDNA3GRHEcR) without the VP16 addition. 

"Acidic" activation domains are apparently "universal" activators in eukaryotes 
(Ptashne (1988) Nature 335 683-689). Other suitable acidic activation domains which have 
been used in fusions are the activator regions of GAL4 itself (region I and region II; Ma and 
Ptashne (Cell (1987) 48, 847-853), the yeast activator GCN4 (Hope and Struhl (1986) Cell 
46, 885-894) and the herpes simplex virus VP16 protein (Triezenberg et al (1988) Genes 
Dev. 2, 718-729 and 730-742). 

Other acidic and non-acidic transcriptional enhancer sequences for example from plant 
fungal and mammalian species can be added to the chimeric ecdysone receptor to enhance 
induced levels of gene expression. 

Chimeric or synthetic activation domains can be generated to enhance induced levels 
of gene expression. 
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Example X - Optimisation by replacement of Heliothis ligand binding domain in 
chimeric effector for that of an ecdysone ligand binding domain of another species 

5 Mutagenesis of the ecdysone ligand binding domain results in the increased sensitivity 

of the chimeric receptor for activating chemical. This can be achieved by deletions in the 
ligand binding domain, use of error prone PCR (Caldwell et al., PCR Meth. Applic 2, 28-33 
1992), and in vitro DNA shuffling PCR (Stemmer, Nature 370, 389-391 1994). To enhance 
the efficacy of the listed techniques we have developed a screening system for enhanced levels 

10 of induced expression (see below). 

An alternative strategy to the mutation of a known ligand binding domain is to identify 
insect species which are particularly sensitive to ecdysteroid agonists. For example 
Spodoptera exigua is particularly sensitive to RH 5992. To investigate the role of the 
ecdysone receptor ligand binding domain in increased sensitivity to RH5992 we have isolated 

15 corresponding DNA sequences from of S. exigua (Figure 40, Sequence ID No. 6). Figure 41, 
Sequence ID No. 7 shows a protein alignment of the hinge and ligand binding domains of the 
Heliothis virescens and Spodoptera exigua ecdysone receptors. The protein sequence 
between the two species is well conserved. 

Results - Manipulation of the ligand binding domain leads to enhanced induced expression 
20 Isolation of an ecdysone ligand binding domain from another lepidopteran species was 

carried out by using degenerate oligonucleotides and PCR of first strand cDN A (Perkin 
Elmer, cDNA synthesis Kit) of the chosen species. The degenerate oligonucleotides at the 5' 
end were HingxhoA and B and at the 3' end ligandxA/B 

25 HingxhoA 5' attgctcgagaaagiccigagtgcgtigticc 3' 

a t 

HingxhoB 5 ' attgctcgagaacgiccigagtgtgtigticc 3 ' 

a c 



30 LigandxA 5' ttactcgagiacgtcccaiatctcttciaggaa 3' 

a tea 

ligandxB 5' ttactcgagiacgtcccaiatctcctciaagaa 3' 

a t t a 



35 



RNA was extracted from 4th instar larvae of Spodoptera exigua since Spodoptera 
exigua appears to be more sensitive to RH5992 than Heliothis (Smagghe and Degfaeele, 
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1994). The first strand cDNA was used in PCR reactions under the following conditions 
20mM Tris-HCL (pH8.4), 50mM KC1, 1.5mM MgCl,, 200mM dNTPs (dATP dCTP dGTP 
and dTTP) and 0.02 U/ml Tag DNA polymerase and in the presence of lug of each Hinge (5« 
3') and Ligand (5'3') oligonucleotides. The PCR cycling conditions were 94°C for 1 minute 
60°C for 2 minutes and 72'C for 1 minute and 35 cycles were carried out. A sample of the 
completed reaction was fractionated in a 1% agarose (w/v) 1 x TBE gel, and the resulting 900 
bp fragment was subcloned into pCRH vector (Invitrogen). The resulting clone (pSKSEcR 1- 
10) were further characterised and sequenced. 

Example X - Manipulation of reporter gene promoter regions can modulate chemical 
induced expression 
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The context of the effector response element in the reporter gene promoter can be 
used to modulate the basa! and induced levels of gene expression. Six copies of the 
glucorticoid response element were fused to 46 bp or 60 bp of the CaMV 35S promoter 
sequence. When used with the effector construct pMF7GRHEcRS the reporter gene 
construct containing 46 bp of the CaMV 35S promoter gave reduced basal and induced levels 

-OttJUSexpression relative to the 60 bp reporter construct (Figure"42). 

Constructs for plant transformation (pES 1 and ES2) have been generated to 
demonstrate the size of minimal promoter can be used to modulate the basal and induced 
levels of gene expression in plants. 

The number and spacing of response elements in the reporter gene promoter can be 
adjusted to enhance induced levels of trans-gene expression. 

The utility of a two component system (effector and reporter gene cassettes) allows 
the spaual control of induced expression. Trans-gene expression can be regulated in an tissue 
specific, organ specific or developmentally controlled manner. This can be achieved by 
driving the effector construct from a spatially or temporally regulated promoter. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ZENECA LIMITED 

(B) STREET: 15 STANHOPE GATE 

(C) CITY: LONDON 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP) : WlY 6LN 



(ii) TITLE OF INVENTION: A GENE SWITCH 

15 (iii > NUMBER OF SEQUENCES: 7 

(iv) COMPUTER READABLE FORM- 

(A) MEDIUM TYPE: Floppy disk 
20 {*} COMPUTER: IBM PC compatible 

£ OPERATING SYSTEM: PC-DOS/MS-DOS 

S0p ™**= Patentin Release #1.0, Version #1.30 (EPO, 
(vi) PRIOR APPLICATION DATA: 
9 * fA) APPLICATION NUMBER: GB 9510759 S 

^ (B) FILING DATE: 26-MAY-1995 



(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9513882 T 
' (B) FILING DATE: 07-JUL-1995 _ 

(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9517316 7 

(B) FILING DATE: 24-AUG-1995 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9605656 9 
<B) FILING DATE: 18-MAR-1996 

(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 116 base pairs 
4S < B > TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Heliothis virescens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TGCGAGGGGT GCAAGGAGTT CTTCAGGCGG AGTGTAACCA AAAATGCAGT GTACATATGC 60 
60 AAATTCGGCC ATGCTTGCGA AATGGATATG TATATGCGGA GAAAATGCCA AGAGTA 116 
(2) INFORMATION FOR SEQ ID NO: 2: 
U) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1934 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Heliothis virescens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE : pSK19R 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TCCACTGGTG TTTTCACCAC CACAGAAAAG GCCTCTGCTC ATTTAGAGGG TGGTGCTAAG 60 

AAGGTCATCA TCTCCTGCTG CCCAGCGCTG ACCCATGTTC GTCGTTGGTG TCAACCTTGA 120 

AGCAGTATGA CCCCTCTTAC AAGGTCATCT CCAACGCCTC CTGCACAACC AACTGCCTCG 180 

CTCCTCTCGC TAAGGTCATC CATGACAACT TCGAGATCAT TGAAGGTCTG ATGACCACTG 240 

TACACGCCAC CACTGCCACC CAGAAGACAG TGGATGGACC CTCTGGTAAA CTGTGGCGTG 300 

ATGGCCGTGG TGCTCAGCAG AATATCATTC CCGCGGAATT CCCCAGCCGC AGCTAGC TAA 360 

CCTGCAGCAG ACACAACCCC TACCTTCCAT GCCGTTACCA ATGCCACCGA CAACACCCAA 420 

ATCAGAAAAC GAGTCAATGT CATCAGGTCG TGAGGAACTG TCTCCAGCTT CGAGTGTAAA 480 

CGGCTGCAGC ACAGATGGCG AGGCGAGGCG GCAGAAGAAA GGCCCAGCGC CGAGGCAGCA 540 

AGAAGAGCTA TGTCTTGTCT GCGGCGACAG AGCCTCCGGA TATCACTACA ACGCGCTCAC 600 

ATGTGAAGGG TGTAAAGGTT TCTTCAGGCG GAGTGTAACC AAAAATGCAG TGTACATATG 660 

CAAATTCGGC CATGCTTGCG AAATGGATAT CTATATGCGG AGAAAATGTC AGGAGTGTCG 720 

GTTGAAGAAA TGTCTTGCGG TGGGCATGAG GCCCGAGTGC GTGGTGCCGG AGAACCAGTG 780 

TGCAATGAAA CGGAAAGAGA AAAAGGCGCA GAGGGAAAAA GACAAATTGC CCGTCAGTAC 840 

GACGACAGTA GACGATCACA TGCCTCCCAT CATGCAATGT GACCCTCCGC CCCCAGAGGC 900 

CGCTAGAATT CTGGAATGTG TGCAGCACGA GGTGGTGCCA CGATTCCTGA ATGAGAAGCT 960 

AATGGAACAG AACAGATTGA AGAACGTGCC CCCCCTCACT GCCAATCAGA AGTCGTTGAT 1020 

CGCAAGGCTC GTGTGGTACC AGGAAGGCTA TGAACAACCT TCCGAGGAAG ACCTGAAGAG 1080 

GGTTACACAG TCGGACGAGG ACGACGAAGA CTCGGATATG CCGTTCCGTC AGATTACCGA 1140 

GATGACGATT CTCACAGTGC AGCTCATCGT AGAATTCGCT AAGGGCCTCC CGGGCTTCGC 1200 

CAAGATCTCG CAGTCGGACC AGATCACGTT ATTAAAGGCG TGCTCAAGTG AGGTGATGAT 1260 

GCTCCGAGTG GCTCGGCGGT ATGACGCGGC CACCGACAGC GTACTGTTCG CGAACAACCA 1320 

GGCGTACACT CGCGACAACT ACCGCAAGGC AGGCATGGCG TACGTCATCG AGGACCTGCT 1380 
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25 
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GCACTTCTGT CGGTGCATGT ACTCCATGAT GATGGATAAC 
AGCCATTGTC ATCTTCTCAG ACCGGCCCGG GCTTGAGCAA 
CCAGAGATAT TACCTGAACA CGCTACGGGT GTACATCCTG 
CCGCGGCGCC GTCATCTTCG GCGAGATCCT GGGCATACTG 
CATGCAGAAC TCCAACATGT GCATCTCCCT CAAGCTGAAG 
CCTCGAGGAG ATCTGGGACG TGGCGGACGT GGCGACGACG 
GGCGCCGGCG CCTCTAGCCC CCGCCCCGCC CGCCCGGCCG 
CTCAGGAGAG AACGCTCATA GACTGGCTAG TTTTAGTGAA 
ACGTGATCAA CCTATTTATA AGGACTGCGA ATTTTACCAC 
CCGATTTCGT ACGG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Heliothis virescens 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: pSK16.1 



GTGCATTATG 
CCCCTGTTGG 
AACCAGAACA 
ACGGAGATCC 
AACAGGAAGC 
GCGACGCCGG 
CCCGCCACCG 
GTGCACGGAC 
TTAAGAGGGC 



CGCTGCTTAC 

TGGAGGACAT 

GCGCGTCGCC 

GCACGCTGGG 

TGCCGCCGTT 

TGGCGGCGGA 

TCTAGCGCGC 

ACTGACGTCG 

ACACCCGTAC 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1934 



40 



45 



50 



55 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CGCTGGTATA ACAACGGACC ATTCCAGACG CTGCGAATGC 
GTGACGTCGT CTTCAGCACT GGGCCTGCCG CCGGCTATGG 
GCGTCGCCCG AGATCGGCGG CCTGGAGCTG TGGGGCTACG 
ATGGCACAGT CGCTGGGCAC CTGCACCATG GAGCAGCAGC 
CCGCAGCAGA CACAACCCCT ACCTTCCATG CCGTTACCAA 
TCAGAAAACG AGTCAATGTC ATCAGGTCGT GAGGAACTGT 
GGCTGCAGCA CAGATGGCGA GGCGAGGCGG CAGAAGAAAG 
GAAGAGCTAT GTCTTGTCTG CGGCGACAGA GCCTCCGGAT 
TGTGAAGGGT GTAAAGGTTT CTTCAGGCGG AGTGTAACCA 
AAATTCGGCC ATGCTTGCGA AATGGATATC TATATGCGGA 
TTGAAGAAAT GTCTTGCGGT GGGCATGAGG CCCGAGTGCG 
GCAATGAAAC GGAAAGAGAA AAAGGCGCAG AGGGAAAAAG 



TCGAGGAGAG 
TGATGTCCCC 
ACGATGGCAT 
AGCCCCAGCC 
TGCCACCGAC 
CTCCAGCTTC 
GCCCAGCGCC 
ATCACTACAA 
AAAATGCAGT 
GAAAATGTCA 
TGGTGCCGGA 
ACAAATTGCC 



CTCGTCTGAG 
GGAATCGCTC 
CACTTACAGC 
GCAGCAGCAG 
AACACCCAAA 
GAGTGTAAAC 
GAGGCAGCAA 
CGCGCTCACA 
GTACATATGC 
GGAGTGTCGG 
GAACCAGTGT 
CGTCAGTACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ACGACAGTAG ACGATCACAT GCCTCCCATC ATGCAATGTG ACCCTCCGCC CCCAGAGGCC 780 

GCTAGAATTC TGGAATGTGT GC AGC AC GAG GTGGTGCCAC GATTCCTGAA TGAGAAGC T A 840 

5 

ATGGAACAGA ACAGATTGAA GAACGTGCCC CCCCTCACTG CCAATCAGAA GTCGTTGATC 900 

GCAAGGCTCG TGTGGTACCA GGAAGGCTAT GAACAACCTT CCGAGGAAGA CCTGAAGAGG 960 

10 GTTACACAGT CGGACGAGGA CGACGAAGAC TCGGATATGC CGTTCCGTCA GATTACCGAG 1020 

ATGACGATTC TCACAGTGCA GC TCATCGTA GAATTCGCTA AGGGCCTCCC GGGCTTCGCC 1080 

AAGATCTCGC AGTCGGACCA GATCACGTTA TTAAAGGCGT GCTCAAGTGA GGTGATGATG 1140 

15 

CTCCGAGTGG CTCGGCGGTA TGACGCGGCC ACCGACAGCG TACTGTTCGC GAACAACCAG 1200 

GCGTACACTC GCGACAACTA CCGCAAGGCA GGCATGGCGT ACGTCATCGA GGACCTGCTG 1260 

20 CACTTCTGTC GGTGCATGTA CTCCATGATG ATGGATAACG TGCATTATGC GCTGCTTACA 1320 

GCCATTGTCA TCTTCTCAGA CCGGCCCGGG CTTGAGCAAC CCCTGTTGGT GGAGGACATC 1380 

CAGAGATATT ACCTGAACAC GCTACGGGTG TACATCCTGA ACCAGAACAG CGCGTCGCCC 1440 

25 

CGCGGCGCCG TCATCTTCGG CGAGATCCTG GGCATACTGA CGGAGATCCG CACGCTGGGC 1500 

ATGCAGAACT CCAACATGTG CATCTCCCTC AAGCTGAAGA ACAGGAAGCT GCCGCCGTTC 1560 

30 CTCGAGGAGA TCTGGGACGT GGCGGACGTG GCGACGACGG CGACGCCGGT GGCGGCGGAG 1620 

GCGCCGGCGC CTCTAGCCCC CGCCCCGCCC GCCCGGCCGC CCGCCACCGT CTAGCGCGCC 1680 

TCAGGAGAGA ACGCTCATAG ACTGGCTAGT TTTAGTGAAG TGCACGGACA CTGACGTCGA 1740 

35 

CGTGATCAAC CTATTTATAA GGACTGCGAA TTTTACCACT TAAGAGGGCA CACCCGTACC 1800 

CGATTTCGTA CGTATTCGGT GACCGACGAC GATGCAGAGC GTGTGTAATG TGAATATATG 1860 

40 TGTTGTTGAA CGATTTGGAG AATATATATT GGTGTTGCTG TTCGGGCCCG CACGCCGTCG 1920 

CCGGTCGGCG GCGATCGCGG CGCCCGCGGC TTCAGTTTTA TTTCGTTTAC G AC TGAGTTG 1980 

GTCACTCGGA TACGACTGTA TGATAAGACT TCGTTCGATA AGTACACCTA CTAAATTACA 2040 

45 

CATACGTACG TAGCTTACGA GAGTTATTAG AGACAAAGAA TATAAGAAGA AG ATGTTTC T 2100 

ATTGGGTGAA AAGTTGATAG TTATGTTTAT TTACCAAAAT TAACAATAAT ACGTTGATTA 2160 

50 ACCTTTCGAG TATAATATTG TGATGAGTCG TCCGCTGTCC ACGTCGCCGT CACATGTTTG 2220 

TTTCTGATGC ACACGTGAGG NGCGTTATCG TGTTTCATGG TTCCATCGTC CTGTGCCCGC 2280 

GACCCTCGAC TAAATGAGTA ATTTAATTTA TTGCTGTGAT TACATTTTAA TGTGTTGATT 2340 

55 

ATCTACCATA GGGTGATATA AGTGTGTCTT ATTACAATAC AAAGTGTGTG TCGTCGATAG 2400 

CTTCCACACG AGCAAGCCTT TTGTTTAAGT GATTTACTGA CATGGACACT CGACCCGGAA 2460 

60 CTTC 2464 
(2) INFORMATION FOR SEQ ID NO: 4: 
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U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 225 . .1955 

<D) OTHER INFORMATION: /codon_start= 225 

/product^ "Heliothis ecdysone receptor" 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ACTCGCGTGC TCTTCTCACC TGTTGCTCGG ATTGTGTTGT 
GCTCGAACGA GACTTCCGAG TCCTATTGGA TTGCACGAAA 
TTCGGTTTCG TTTGAACGTT GCGTAGACGA GTGGTGCATG 
AGTTTAGTGC GAGGAAAAAG TGAAGTGAAA GCCTTCCTCG 
GTGGATACCG GAGGTGTGAC ACGCTCGCCG ACATGAGACG 

^0 CATTGGAGAC - GCTGCGAATG CTCGJK^GA _ GCTCGTCTGA" 

TGGGCCTGCC GCCGGCTATG GTGATGTCCC CGGAATCGCT 
GCCTGGAGCT GTGGGGCTAC GACGATGGCA TCACTTACAG 
CCTGCACCAT GGAGCAGCAG CAGCCCCAGC CGCAGCAGCA 
TACCTTCCAT GCCGTTACCA ATGCCACCGA CAACACCCAA 
CATCAGGTCG TGAGGAACTG TCTCCAGCTT CGAGTGTAAA 
AGGCGAGGCG GCAGAAGAAA GGCCCAGCGC CGAGGCAGCA 
GCGGCGACAG AGCCTCCGGA TATCACTACA ACGCGCTCAC 
TCTTCAGGCG GAGTGTAACC AAAAATGCAG TGTACATATG 
AAATGGATAT CTATATGCGG AGAAAATGTC AGGAGTGTCG 
TGGGCATGAG GCCCGAGTGC GTGGTGCCGG AGAACCAGTG 
AAAAGGCGCA GAGGGAAAAA GACAAATTGC CCGTCAGTAC 
TGCCTCCCAT CATGCAATGT GACCCTCCGC CCCCAGAGGC 
TGCAGCACGA GGTGGTGCCA CGATTCCTGA ATGAGAAGCT 
AGAACGTGCC CCCCCTCACT GCCAATCAGA AGTCGTTGAT 
AGGAAGGCTA TGAACAACCT TCCGAGGAAG ACCTGAAGAG 
ACGACGAAGA CTCGGATATG CCGTTCCGTC AGATTACCGA 



ACTAGAAAAA 
GTCGAGACAG 
TCCATGAGTC 
GAGGATGTCC 
CCGCTGGTAT 

"ggtgacgtcg 

CGCGTCGCCC 
CATGGCACAG 
GCCGCAGCAG 
ATCAGAAAAC 
CGGCTGCAGC 
AGAAGAGCTA 
ATGTGAAGGG 
CAAATTCGGC 
GTTGAAGAAA 
TGCAATGAAA 
GACGACAGTA 
CGCTAGAATT 
AATGGAACAG 
CGCAAGGCTC 
GGTTACACAG 
GATGAC GATT 



AGTTGTCGCC 
TGGATAGCGA 
GCGTTTAGAT 
CTCGGCGCTC 
AACAACGGAC 



TCTTCAGCAC 
GAGATCGGCG 
TCGCTGGGCA 
ACACAACCCC 
GAGTCAATGT 
ACAGATGGCG 
TGTCTTGTCT 
TGTAAAGGTT 
CATGCTTGCG 
TGTCTTGCGG 
CGGAAAGAGA 
GACGATCACA 
CTGGAATGTG 
AACAGATTGA 
GTGTGGTACC 
TCGGACGAGG 
CTCACAGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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AGCTCATCGT AGAATTCGCT AAGGGCCTCC CGGGCTTCGC CAAGATCTCG CAGTCGGACC 1380 
AGATCACGTT ATTAAAGGCG TGCTCAAGTG AGGTGATGAT GCTCCGAGTG GCTCGGCGGT 1440 
5 ATGACGCGGC CACCGACAGC GTACTGTTCG CGAACAACCA GGCGTACACT CGCGACAACT 1500 



ACCGCAAGGC AGGCATGGCG TACGTCATCG AGGACCTGCT GCACTTCTGT CGGTGCATGT 1560 

ACTCCATGAT GATGGATAAC GTGCATTATG CGCTGCTTAC AGCCATTGTC ATCTTCTCAG 1620 

10 

ACCGGCCCGG GCTTGAGCAA CCCCTGTTGG TGGAGGAGAT CCAGAGATAT TACCTGAACA 1680 

CGCTACGGGT GTACATCCTG AACCAGAACA GCGCGTCGCC CCGCGGCGCC GTCATCTTCG 1740 

15 GCGAGATCCT GGGCATACTG ACGGAGATCC GC ACGCTGGG CATGCAGAAC TCCAACATGT 1800 

GCATCTCCCT CAAGCTGAAG AACAGGAAGC TGCCGCCGTT CCTCGAGGAG ATCTGGGACG 1860 



TGGCGGACGT GGCGACGACG GCGACGCCGG TGGCGGCGGA GGCGCCGGCG CCTCTAGCCC 1920 

20 

CCGCCCCGCC CGCCCGGCCG CCCGCCACCG TCTAGCGCGC CTCAGGAGAG AACGCTCATA 1980 
GACTGGCTAG TTTTAGTGAA GTGCACGGAC ACTGACGTCG ACGTGATCAA CCTATTTATA 2040 
25 AGGACTGCGA ATTTTACCAC TTAAGAGGGC ACACCCGTAC CCGATTTCGT ACGTATTCGG 2100 
TGAC CGACG A CGATGCAGAG CGTGTGTAAT GTGAATATAT GTGTTGTTGA ACGATTTGGA 2160 
GAATATATAT TGGTGTTGCT GTTCGGGCCC GCACGCCGTC GCCGGTCGGC GGCGATCGCG 2220 

30 

GCGCCCGCGG CTTCAGTTTT ATTTCGTTTA CGACTGAGTT GGTCACTCGG ATACGACTGT 2280 
ATGATAAGAC TTCGTTCGAT AAGTACACCT ACTAAATTAC ACATACGTAC GTAGCTTACG 2340 
35 AGAGTTATTA GAGACAAAGA ATATAAGAAG AAGATGTTTC TATTGGGTGA AAAGTTGATA 2400 
GTTATGTTTA TTTACCAAAA TTAACAATAA TACGTTGATT AACCTTTCGA GTATAATATT 2460 
GTGATGAGTC GTCCGCTGTC CACGTCGCCG TCACATGTTT GTTTCTGATG CACACGTGAG 2520 

40 

GNGCGTTATC GTGTTTCATG GTTCCATCGT CCTGTGCCCG CGACCCTCGA CTAAATGAGT 2580 
AATTTAATTT ATTGCTGTGA TTACATTTTA ATGTGTTGAT TATCTACCAT AGGGTGATAT 2640 
45 AAGTGTGTCT TATTACAATA CAAAGTGTGT GTCGTCGATA GCTTCCACAC GAGCAAGCCT 2700 
TTTGTTTAAG TGATTTACTG ACATGGACAC TCGACCCGGA ACTTC 2745 
(2) INFORMATION FOR SEQ ID NO: 5: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



WO 96/37609 



-46- 



PCT/GB96/01195 



10 



20 



25 



35 



40 



50 



55 



Met Ser Leu Gly Ala Arg Gly Tyr Arg Arg Cys Asp Thr Leu Ala Asp 



10 15 



Met Arg Arg Arg Trp Tyr Asn Asn Gly Gly Phe Gin Thr Leu Arg Met 

2 5 



30 



Leu Glu Glu Ser Ser Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu 

" 40 45 

Pro pro Ala Met Val Met Ser Pro Glu Ser Leu Ala Ser Pro Glu lie 
3U 55 60 

Gly Gly Leu Glu Leu Trp Gly Tyr Asp Asp Gly He Thr Tyr Ser Met 

15 75 80 

Ala Gin Ser Leu Gly Thr Cys Thr Met Glu Gin Gin Gin Pro Gin Pro 

85 90 95 

Gin Gin Gin Pro Gin Gin Thr Gin Pro Leu Pro Ser Met Pro Leu Pro 
100 105 110 

Met Pro Pro Thr Thr Pro Lys Ser Glu Asn Glu Ser Met Ser Ser Gly 
±xs 120 125 

Arg Glu Glu Leu Ser Pro Ala Ser Ser Val Asn Gly Cys Ser Thr Asp 

135 140 

Gly Glu Ala Arg Arg Gin Lys Lys Gly Pro Ala Pro Arg Gin Gin Glu 

.30 1-55 160- 

Glu Leu Cys Leu Val Cys Gly Asp Arg Ala Ser Gly Tyr His Tyr Asn 



170 175 



Ala Leu Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Val Thr 

185 190 

Lys Asn Ala Val Tyr lie Cys Lys Phe Gly His Ala Cys Glu Met Asp 

4 vU 



205 



He Tyr Met Arg Arg Lys Cys Gin Glu Cys Arg Leu Lys Lys Cys Leu 

Ala Val Gly Met Arg Pro Glu Cys Val Val Pro Glu Asn Gin Cys Ala 

45 JU 235 240 

Met Lys Arg Lys Glu Lys Lys Ala Gin Arg Glu Lys Asp Lys Leu Pro 
245 250 255 



Val Ser Thr Thr Thr Val Asp Asp His Met Pro Pro He Met Gin Cys 

265 270 
Asp Pro Pro Pro Pro Glu Ala Ala Arg He Leu Glu Cys val Gin His 



285 



Glu Val Val Pro Arg Phe Leu Asn Glu Lys Leu Met Glu Gin Asn Arg 

295 300 

Leu Lys Asn Val Pro Pro Leu Thr Ala Asn Gin Lys Ser Leu He Ala 
60 J1 ° 3X5 320 

Arg Leu Val Trp Tyr Gin Glu Gly Tyr Glu Gin Pro Ser Glu Glu Asp 
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CAAAGGGAAA AAGACAAGTT GCCAGTCAGT ACAACGACAG TGGATGATCA CATGCCTCCC 
5 ATTATGCAGT GTGATCCACC GCCTCCAGAG GCCGCAAGAA TTCACGAGGT GGTGCCACGA 
TTCCTGAATG AAAAGCTAAT GGACAGGACA AGGCTCAAGA ATGTGCCCCC TCACTGCCAA 
CCAGAAGTCC TTAATAGCGA GGCTGGTCTG GTACCAAGAA GGCTATGAAC AGCCATCAGA 
10 AGAGGATCTA AAAAGAGTCA CACAGTCGGA TGAAGACGAA GAAGAGTCGG ACATGCCGTT 
CCGTCAGATC ACCGAGATGA CGATCCTCAC AGTGCAGCTC ATTGTTGAAT TCGCTAAGGG 
^ CCTACCAGCG TTCGCAAAGA TCTCACAGTC GGATCAGATC ACATTATTAA AGGCCTGTTC 
GAGTGAGGTG ATGATGTTGC GAGTAGCTCG GCGGTACGAC GCGGCGACAG ACAGCGTGTT 
GTTCGCCAAC AACCAGGCGT ACACCCGCGA CAACTACCGC AAGGCAGGCA TGGCCTACGT 
CATCGAGGAC CTGCTGCACT TCTGCCGGTG CATGTACTCC ATGATGATGG ATAACGTCCA 
CTATGCACTG CTCACTGCCA TCGTCATTTT CTCAGACCGA CCCGGGCTTG AGCTAACCCT 
^ GTTGGTGGAG GAGATCCAGA GATATTACCT GAACACGCTG CGGGTGTACA TCCTGAACCA 
GAACAGTCGG TCGCCGTGCT GCCCTGTCAT CTACGCTAAG ATCCTCGGCA TCCTGACGGA 
GCTGCGGACC CTGGGCATGC AGAACTCCAA CATGTGCATC TCACTCAAGC TGAAGAACAG 

-30- -GAAGGTGCCG-CCGTTCTTCG "AGGATATCTG - GGACGTCCTC GAGTAAAA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY" : linear 



20 



40 



(ii) MOLECULE TYPE: protein 



45 < x i> SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Arg Pro Glu Cys Val Val Pro Glu Asn Gin Cys Ala Met Lys Arg Lys 

50 Glu Lys Lys Ala Gin Arg Glu Lys Asp Lys Leu Pro Val Ser Thr Thr 

20 25 30 

Thr Val Asp Asp His Met Pro Pro lie Met Gin Cys Asp Pro Pro Pro 
55 5 40 « 

Pro Glu Ala Ala Arg lie Leu Glu Cys Val Gin His Glu Val Val Pro 
DO 55 60 

a Arg Phe Leu Asn Glu Lys Leu Met Glu Gin Asn Arg Leu Lys Asn Val 

75 80 
Pro Pro Leu Thr Ala Asn Gin Lys Ser Leu He Ala Arg Leu Val Trp 
85 90 95 
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Tyr Gin Glu Gly Tyr Glu Gin Pro Ser Glu Glu Asp Leu Lys Arg Val 
100 105 110 

Thr Gin Ser Asp Glu Asp Asp Glu Asp Ser Asp Met Pro Phe Arg Gin 
115 120 125 

lie Thr Glu Met Thr lie Leu Thr Val Gin Leu lie Val Glu Phe Ala 
130 135 140 

Lys Gly Leu Pro Gly Phe Ala Lys lie Ser Gin Ser Asp Gin lie Thr 
145 150 155 160 

Leu Leu Lys Ala Cys Ser Ser Glu Val Met Met Leu Arg Val Ala Arg 
165 170 175 

Arg Tyr Asp Ala Ala Thr Asp Ser Val Leu Phe Ala Asn Asn Gin Ala 
180 185 190 

Tyr Thr Arg Asp Asn Tyr Arg Lys Ala Gly Met Ala Tyr Val He Glu 
195 200 205 

Asp Leu Leu His Phe Cys Arg Cys Met Tyr Ser Met Met Met Asp Asn 
210 215 220 

Val His Tyr Ala Leu Leu Thr Ala He Val He Phe Ser Asp Arg Pro 
225 230 235 240 

Gly Leu Glu Gin Pro Leu Leu Val Glu Glu He Gin Arg Tyr Tyr Leu 
245 250 255 

Asn Thr Leu Arg Val Tyr He Leu Asn Gin Asn Ser Ala Ser Pro Arg 
260 265 270 

Gly Ala Val He Phe Gly Glu He Leu Gly He Leu Thr Glu He Arg 
275 280 285 

Thr Leu Gly Met Gin Asn Ser Asn Met Cys He Ser Leu Lys Leu Lys 
290 295 300 

Lys Arg Lys Leu Pro Pro Phe Leu Glu Glu He Asp Trp Asp Val 
305 310 315 
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CLAIMS 

DNA comprising the sequence shown in Seq ID No. 2. 
DNA comprising the sequence shown in Seq ID No. 3. 
DNA comprising the sequence shown in Seq ID No. 4. 

DNA comprising a sequence which shows 60% or more homology with the sequence 
shown in Seq ID No 1, 2 or 3. 

DNA according to claim 4 wherein said homology is in the range of 65% to 99%. 

DNA which hybridises to the sequence shown in Seq. ID No. 2, 3 or 4, and which 
codes for at least part of the Heliothis ecdysone receptor. 

DNA which is degenerate as a result of the genetic code to the DNA of any one of 
-claims-l-to 6-and which codes for a polypeptide ^hlch is aFle^st part' of the HelTothiT 
ecdysone receptor. 

DNA comprising part of the sequence shown in Seq ID No. 2, and which codes for at 
least part of the Heliothis ecdysone receptor ligand binding domain. 

DNA comprising part of the sequence shown in Seq ID No. 3, and which codes for at 
least part of the Heliothis ecdysone receptor ligand binding domain. 

DNA comprising part of the sequence shown in Seq ID No. 4, and which codes for at 
least part of the Heliothis ecdysone receptor ligand binding domain. 

DNA comprising a sequence which shows 60% or more homology with the sequence 
of claim 8, 9 or 10. 

DNA according to claim 1 1 wherein said homology is in the range of 65% to 99%. 

DNA which hybridises to the DNA of any one of claims 8 to 12 and which codes for 
at least pan of the Heliothis ecdysone receptor ligand binding domain. 



WO 96/37609 PCT/GB96/01195 

14. DNA which is degenerate as a result of the genetic code to the DNA of any one of 

claims 8 to 12 and which codes for a polypeptide which is at least part of the Heliothis 
ecdysone receptor ligand binding domain. 

5 15. DNA comprising part of the sequence shown in Seq ID No. 2, and which codes for at 
least part of the Heliothis ecdysone receptor DNA binding domain. 

16. DNA comprising part of the sequence shown in Seq ID No. 3, and which codes for at 
least part of the Heliothis ecdysone receptor DNA binding domain. 

10 

17. DNA comprising part of the sequence shown in Seq ID No. 4, and which codes for at 
least part of the Heliothis ecdysone receptor DNA binding domain. 

18. DNA comprising a sequence which shows 60% or more homology with the sequence 
15 of claim 15, 16 or 17. 

19. DNA according to claim 18 wherein said homology is in the range of 65% to 99%. 

20. DNA which hybridises to the DNA of any one of claims 15 to 19 and which codes for 
20 at least part of the Heliothis ecdysone receptor DNA binding domain. 

21 . DNA which is degenerate as a result of the genetic code to the DNA of any one of 
claims 15 to 19 and which codes for a polypeptide which is at least part of the 
Heliothis ecdysone receptor DNA binding domain. 

25 

22. DNA comprising part of the sequence shown in Seq ID No. 2, and which codes for at 
least part of the Heliothis ecdysone receptor transactivation domain. 

23. DNA comprising part of the sequence shown in Seq ID No. 3, and which codes for at 
30 least part of the Heliothis ecdysone receptor transactivation domain. 

24. DNA comprising part of the sequence shown in Seq ID No. 4, and which codes for at 
least part of the Heliothis ecdysone receptor transactivation domain. 

35 25. DNA comprising a sequence which shows 60% or more homology with the sequence 
of claim 22, 23 or 24. 
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26. DNA according to claim 25 wherein said homology is in the range of 65% to 99%. 



27. 



DNA which hybridises to the DNA of any one of claims 22 to 26 and which codes for 
at least part of the Heliothis ecdysone receptor transaction domain. 



28. DNA which is degenerate as a result of the genetic code to the DNA of any one of 
claims 22 to 26 and which codes for a polypeptide which is at least part of the 
Heliothis ecdysone receptor transactivation domain. 



10 29. 



DNA comprising part of the sequence shown in Seq ID No. 2, and which codes for at 
least part of the Heliothis ecdysone receptor hinge domain. 



15 



30. DNA comprising part of the sequence shown in Seq ID No. 3, and which codes for a, 
least part of the Heliothis ecdysone receptor hinge domain. 

31. DNA comprising part of the sequence shown in Seq ID No. 4, and which codes for at 
least part of the Heliothis ecdysone receptor hinge domain. 

32. DNA comprising a sequence which shows 60% or more homology with the sequence 
20 of claim 29, 30 or 31. 

33. DNA according to claim 32 wherein said homology is in the range of 65% to 99%. 

34. DNA which hybridises to the DNA of any one of claims 29 to 33 and which codes for 
at least part of the Heliothis ecdysone receptor hinge domain. 



25 



35. 



30 



35 



DNA which is degenerate as a result of the genetic code of the DNA of any one of 
clatms 29 to 33 and which codes for a polypeptide which is at least part of the 
Heliothis ecdysone receptor hinge domain. 

36. DNA having part of the sequence shown in Seq ID No. 2, and which codes for at least 
part of the Heliothis ecdysone receptor carboxy terminal region. 

37. DNA having part of the sequence shown in Seq ID No. 3, and which codes for at least 
part of the Heliothis ecdysone receptor carboxy terminal region. 
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40. 
41. 

10 

42. 

15 

43. 

20 44. 
45. 

25 

46. 

30 

47. 
35 48. 



DNA having part of the sequence shown in Seq ID No. 4, and which codes for at least 
part of the Helibthis ecdysone receptor carboxy terminal region. 

DNA comprising a sequence which shows 60% or more homology with the sequence 
of claim 36, 37 or 38. 

DNA according to claim 39 wherein said homology is in the range of 65% to 99%. 

DNA which hybridises to the DNA of any one of claims 36 to 40 and which codes for 
at least part of the Heliothis ecdysone receptor carboxy terminal region. 

DNA which is degenerate as a result of the genetic code of the DNA of any one of 
claims 36 to 40 and which codes for a polypeptide which is at least part of the 
Heliothis ecdysone receptor carboxy terminal region. 

A polypeptide comprising the Heliothis ecdysone receptor or a fragment thereof, 
wherein said polypeptide is substantially free from other proteins with which it is 
ordinarily associated, and which is coded for by the DNA of any preceding claim. 



A polypeptide comprising the amino acid sequence shown in Seq ID No. 4 or any 
allelic variant or derivative thereof. 

A polypeptide comprising part of the amino acid sequence shown in Seq ID No. 4 or 
any allelic variant or derivative thereof, which sequence provides the Heliothis 
ecdysone receptor ligand binding domain. 

A polypeptide comprising part of the amino acid sequence shown in Seq ID No. 4 or 
any allelic variant or derivative thereof, which sequence provides the Heliothis 
ecdysone receptor DNA binding domain. 

A polypeptide comprising part of the amino acid sequence shown in Seq ID No. 4 or 
any allelic variant or derivative thereof, which sequence provides the Heliothis 
ecdysone receptor transactivation domain. 

A polypeptide comprising part of the amino acid sequence shown in Seq ID No. 4 or 
any allelic variant or derivative thereof, which sequence provides the Heliothis 
ecdysone receptor hinge domain. 
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49. A polypeptide comprising part of the amino acid sequence shown in Seq ID No. 4 or 
any allelic variant or derivative thereof, which sequence provides the Heliothis 
ecdysone receptor carboxy terminal region. 



5 



50. 



A polypeptide according to any one of claims 44 to 49 wherein said derivative is 
homologous variant which includes conservative amino acid changes. 



10 



51. DNA 



52. DNA 



comprising the sequence shown in Seq ID No. 6. 



15 



compnsmg a sequence which shows 60% or more homology with the sequence 
shown in Seq ID No. 6. 

53. DNA according to claim 52 wherein said homology is in the range of 65% to 99%. 

54. DNA which hybridises to the DNA sequence shown in Seq ID No. 6 and which codes 
for at least part of Spodoptera ecdysone receptor. 

55. DNA which is degenerate as a result of the genetic code to the DNA of any one of 
20 claims 51 to 54. 

56. DNA comprising part of the sequence shown in Seq ID No. 6, and which codes for at 
least part of the Spodoptera ecdysone receptor ligand binding domain. 

25 57. DNA comprising a sequence which shows 60% or more homology with the sequence 
of claim 56. 

58. DNA according to claim 57 wherein said homology is in the range of 65% to 99%. 



30 59. 



DNA which hybridises to the DNA of any one of claims 56 to 58 and which codes for 
at least part of the Spodoptera ecdysone receptor ligand binding domain. 



60. DNA which is degenerate as a result of the genetic code to the DNA of any one of 

claims 56 to 58 and which codes for at least part of the Spodoptera ecdysone receptor 
35 iigand binding domain. 
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DNA comprising part of the sequence shown in Seq ID No. 6, and which codes for at 
least part of the Spodoptera ecdysone receptor hinge domain. 

DNA comprising a sequence which shows 60% or more homology with the sequence 
of claim 61. 

DNA according to claim 62 wherein said homology is in the range of 65% to 99%. 

DNA which hybridises to the DNA of any one of claims 61 to 63 and which codes for 
at least part of the Spodoptera ecdysone receptor hinge domain. 

DNA which is degenerate as a result of the genetic code to the DNA of any one of 
claims 61 to 63 and which codes for at least part of the Spodoptera ecdysone receptor 
hinge domain. <. 

A polypeptide coded for by the DNA of any one of claims 5 1 to 65. 

A fusion polypeptide comprising the polypeptide of claim 45 or 50 (when dependent 
upon claim 45) and functionally linked to a DNA binding domain and a transactivation 
domain. 

Recombinant DNA comprising the DNA of any one of claim 8 to 14 functionally 
linked to DNA encoding a DNA binding domain and a transactivation domain. 

A fusion polypeptide according to claim 67 or recombinant DNA according to claim 
68 wherein the DNA binding domain and/or transactivation domain is fungal, 
bacterial, plant or mammalian. 

A fusion polypeptide or recombinant DNA according to claim 69 wherein the DNA 
binding domain is GAL4 or A1CR/A. 

A fusion polypeptide or recombinant DNA according to claim 69 or 70 wherein the 
transactivation domain is VP16. 

A fusion polypeptide or recombinant DNA according to claim 69 wherein the DNA 
binding domain and/or transactivation domain is from a steroid receptor superfamily 
member. 
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A fusior, .polypeptide or recombinant DNA according to claim 72 wherein the DNA 
mdmg domam and/or transaction domain is from , glucocorticoid or a 

bpodoptera ecdysone receptor. 

A recombinant DNA construct comprising recombinant DNA of any one of claims 68 
to 73; and DNA which codes for a gene operablv linw* , 

kn™ 8 °P eraD1 y «nked to a promoter sequence and a 

A fusion polypeptide comprising me polypeptide of claim 46 or 50 (when dependent 
upon cla.m 46) and functionally linlced to a ligand binding domain aid a 
transactrvation domain. 

To DNA binant H DNA COmPriSi " 8 ^ * "* ° f CjaimS 15 to 21 Anally linfced 
to DNA encodong a hgand binding domain and a transaction domain. 

Afeionpo^ 

76 whenem the hgand binding domain and/or transaction domain is fung* 

bacterial, plant or mammalian. S 

Atofao polypeptide or recombinant DNA according to claim 77 wherein the 

transactivation domain is VP 1 6. 

A fusion polypeptide or recombinant DNA according to claim 77 wherein the ligand 
b^gdomam and/or transactivation domain is from a steroid receptor su^y 

bin^T^f ° r reC ° mbinant DNA —ding to claim 79 wherein the ligand 
^ysone r^ptoT tranSaCtiVati ° n domain is fr °m a glucocorticoid or Spo<k>pera 

t ri b DN A T C ° mPriSin8 ° NA 0f ■* one of claims 76 

holne ^ COdCS 3 geDe B » ted to a I—— -quence and a 
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85. 
15 86. 

87. 

20 

88. 

25 

89. 

30 



A fusion polypeptide comprising the polypeptide of claim 47 or 50 (when dependent 
upon claim 47) and functionally linked to a ligand binding domain and a DNA binding 
domain. 

Recombinant DNA comprising the DNA of any one of claims 22 to 28 functionally 
linked to DNA encoding a ligand binding domain and a DNA binding domain. 

A fusion polypeptide according to claim 82 or recombinant DNA according to claim 
83 wherein the ligand binding domain and/or DNA binding domain is fungal, bacterial, 
plant or mammalian. 

A fusion polypeptide or recombinant DNA according to claim 84 wherein the DNA 
binding domain is GALA or A ICR/A. 

A fusion polypeptide or recombinant DNA according to claim 84 wherein the ligand 
binding domain and/or DNA binding domain is from a steroid receptor superfamily 
member. 

A fusion polypeptide or recombinant DNA according to claim 86 wherein the ligand 
binding domain and/or DNA binding domain is from a glucocorticoid or Spodoptera 
ecdysone receptor. 

A recombinant DNA construct comprising recombinant DNA of any one of claims 82 
to 87; and DNA which codes for a gene operably linked to a promoter sequence and a 
hormone response element, which is responsive to the DNA binding domain coded for 
by said recombinant DNA. 

A recombinant DNA construct comprising DNA according to any one of claims 1 to 
7; and DNA comprising a sequence which codes for a gene operably linked to a 
promoter sequence and at least one hormone response element which is responsive to 
the DNA binding domain coded for by said DNA of any one of claim 1 to 7. 

A recombinant DNA construct according to any one of claims 74, 81, 88 and 89 
wherein said promoter sequence codes for a constitutive, spatially or temporally 
regulating promoter. 
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91. 



92. 



A recombinant DNA construct according to any one of claims 74, 81, 88 and 89 
wherem there is more than one copy of the hormone response element. 

A cell transformed with the DNA of any one of claims 1 to 42, and 51 to 65- the 

67 70 to 73. 75. 77 to 80, 82 and 84 to 87; the recombinant nucleic acid of any one of 



10 93. 
94. 

15 95. 



or mammalian cell. 



25 98. 



A cell according to claim 92 wherein said cell is a plant, fungal 
01^^88" C ° mPriSing ~ ° f " ~ - 

sl^T td T g COmP ° UndS CaPabk ° f b6ing l ° M insect « — ptor 

superfannly member comprising seining compounds for binding to said polypeptide 

^ m 'Xf t08G ' 82and8r ^^ 

96. A compound selected using the method of claim 95. 

97. An agriculture or pharmaceutical composition comprising the compound of claim 96. 
Use of the compound of claim 96 as an agrochemical or a pharmaceutical. 

"' L"^cl Pr t Cin8 8 Pr °* in ' ^ ° r COn * risin * ^-ing into 

the cel, of chum 92, a compound which binds to the ligand binding domain in said cell. 
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